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Chapter 1: Introduction 

1.1 Notation 

This document defines structure and enumeration data tv-pes for use by system software. An 
enumeration definition names the enumeration and the members, gives the integral value of each 
member, and may give one or more columns of commentary. 

A structure definition names the structure (typically of a hardware register) and describes the 
fields. Fields are written using the notation "Bits hbn-lbn'' where hbn is the high bit number and 
Ibn is the low bit number. The width of the field is hbn - /6n + 1. Each field has a field name, a 
type, and one or more columns of commentary' text. The field type is either a predefined type or 
aji enumeration type declared elsewhere. 

The enumeration names, enumeration members, structure names, field names, aind base types all 
appear in the index. The enumerations and structures defined in this manual are available for use 
in assembly language and C programs, including the assembler and compilers themselves. 

The notation used for operations and instructions is described in §11.1. 

Some descriptions in this document include program fragments. Fragments are formatted so that 
keywords appear in bold face and comments appear in italics. 

1.2 Data Types 

The memor\' system can load and store eight-bit bytes, 16-bit quarterwords (2 bnes). 32-bit half- 
words (4 bytes), or 64-bit words (8 bytes). Bits are numbered from right to left: the least significant 
bit is bit number 0. 

The most important architecturally supported data types are these: 
bit vector 

A bit vector may be of any length and may span one or more word boundaries, 
signed integer 

Signed integers are interpreted in two's complement. Byte, quarterword, and halfword signed 
integers are sign-extended to 64 bits when they are loaded and quietly truncated to the proper 
length when they are stored. 

unsigned integer 

Byte, quarterword, and halfword unsigned integers are zero-extended to 64 bits when they 
are loaded and quietly truncated to the proper length when they are stored. 

floating point 

Floating-point numbers and operations conform to IEEE Standard 754, Single (32-bit) and 
double (64-bit) basic formats are supported. Support for a 128-bit floating-point format is 
also provided. 



1.2 Data Types 



10 



P — -issor Principles of Operation 



pointer 

A pointer has two subfields. The most significant 16 bits is the access control field, described 
m ^6.1. The remaining 48 bits make up the address field, described in §6.2. 

instruction 

Instructions, composed of operations, are described in §3. 
stream status word 

A stream status word (ssw), contains status and control information for the instruction stream 
m Its upper halfword and a program counter in the lower halfword. It is described in §2.1. 
resource counter 

detribriHir^'' "^""'^ accounting and performance monitoiing. They are 

1 is true The name of each denved type is a mnemonic to help interpret what the bit controls 
when act:v^naj^ely when it is set, is true, or is assigned 1. all of which are equivalent terms For 
examp e, a vanable of type Flag notes that an exception has occurred if it is set: a variable^ t^? 
SignBit mdicates a negative number if it is set. 



type ^ 


width(bits) 


base type 


description 


Reg 


5 


Uns 


a register number 


ProgramAddrUns 

PageNumber 

ProgFrame 


32 
20 

17 


Uns 
Uns 

Uns 


a ProgramAddress structure treated as an Uns: see §2.1 
a virtual program page address 
a physical memory offset 


DataAddrUns 
DataSegment 
SegmentOflfset 
DataFrame 


48 
20 
15 
19 


Uns 
Uns 
Uns 
Uns 


a DataAddress treated as an Uns; see §6.1 

a virtual data segment number 

an oiFset into a virtual data segment 

a physical memory offset specified as a frame number 

or a physical memory frame number 



1.3 Storage Classes 

Each stream has available a number of different kinds of storage. 

° nrt'l!! ^ °^ ^ P°^«^*^^^y available to any stream on anv 

Zl Z■\ ^ "T"^- "^'^"^'^ ^'^j^^^^^ stream^s processor 

llel r '"^ "^^^^^y to as "local" and is curxeBtlv 

used onl> to store mstructions, data maps, and program maps for the local processor. Mos't 
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data memory accesses are distributed across the entire system. The part of data memory that 
stores insinictions for its processor, is sometimes called ' program memory". Every word in 
data memory has a four-bit access state, which modifies the behavior of memory references 
to any part of the word: see §6.1. 

• The 31 general-purpose registers are used as the sources and destination for almost all oper- 
ations. Register 0 always reads as 64 bits of 0, and values written into it are discarded. 

• The stream status word (ssw) contains condition codes, the trap mask, the mode, and the 
program counter. The ssw is described in §2,1. 

• The eight target registers contain program addresses and are used as arguments for branch 
operations: Target 0 points to the trap handler. See §2.2. 

• The exception register flags the exception(s) that have been detected and raised. A raised 
exception w^ill cause a trap if the trap is not disabled by the appropriate bit in the trap mask 
of the stream status word. The exception register also contains the register poison flags. See 
§9.1. 

• The result code register describes exceptional result values from the function units: see §9.1. 

• The trap registers are used by the trap handler to save the state of the trapping stream. The 
trap registers are described in §9.2. 
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Chapter 2; Streams 

Each physical processor supports a variable number of instruction streams, or streams for short 
Each stream appears to be (and is programmed like) a wide-instruction RISC processor The 
processor hardware selects streams for execution and executes a single instruction from each in turn 
Streams are allocated, created, and destroyed dynamically: the active streams are multiplexed bv 
the processor hardware onto a single set of pipelined functional units. 

Streams may be active or idle. Aji active stream competes with other streams to issue instructions, 
while idle streams do not. A stream is activated and initialized with a skeleton execution envi- 
ronment by the unprivileged STREAM.CREATE operation. Unprivileged STREAM-RESERVE 

?^T1tv tk ctI ^A^^rnT-rJ °^ "'"^^ subsequent activation by STREAM.- 

i^HLALL. Ihe bTREAM-QDIT operation returns a stream that executes it to the idle state. 

A stream executes at one of four privUege levels: user, supervisor, kernel, or IPL. The piivUege 
level of a stream determines the operations it may execute and the kinds of memory access it is 
permitted. Levels are described further in §8.1. 

Each active stream in a processor belongs to one of sixteen protection domains. A protection 
domam has registers that limit the number of streams it can contain and define the memory 
accesses available to its streams. Protection domains are described in more detail in §8 2. 



2.1 Stream Status Word 



The stream status word (ssw) is shown below. The ssw contains the condition codes from the 
most recent four ".TEST" operations; a trap mask which selectively disables traps from raised 
exceptions: a mode field describing how arithmetic, memory references, and lookahead are to be 
done; and a program counter containing the address of the instruction being executed. 
Bits Wd Field 



Type 



StreamStatusWord: Condition Vector 

63-61 3 ccJJ CondCode 



60-58 3 cc_2 



57-55 3 cc-1 



54-52 3 ccO 



CondCode 



CondCode 



CondCode 



Description 



condition code cvy, result from fourth 
most recent .TEST operation; see §4 
condition code CV2: result from third 
most recent .TEST operation; see §4 
coridition code cvj: result from sec- 
ond most recent .TEST operation; see 
§4 

condition code cvq: result from most 
recent .TEST operation; see §4 
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StreamStatusWord: Trap Mask 



51 


1 


hardware-trap.- 


Boolean 


disable hardware traps 






disable 




50 


1 


system.trap.- 


Boolean 


disable system traps 






disable 




49 


1 


domain-sigDaL- 


Boolean 


disable domain signal traps 






trap.disable 




48 


1 


user.trap^disable 


Boolean 


disable user traps 


47-4: 


5 3 


0 




reserved 


44 


1 


floatJnvalid.- 


Boolean 


disable float invalid trap 






t rap -disable 




43 


1 


float ^ero-div-- 


Boolean 


disable float zero divide trap 






trap.disable 




42 


1 


float-overflow.- 


Boolean 


disable float overflow trap 






trap.disable 




41 


1 


float.underflow-- 


Boolean 


disable float underflow trap 






trap-disable 




40 


1 


floatJnexact-- 


Boolean 


disable float inexact trap 






trap.disable 





StreamStatusWord: Mode 



39 
38 



37 
36 

35 
34 



ssw-override 



Boolean 



specJoad-enable Boolean 

unaligned-data-- Boolean 
enable 

lookahead-disable Boolean 



1 count-disable 



Boolean 



33-32 2 round-mode RoundMode 
StreamStatusWord: Program Counter 

31-0 32 pc ProgramAddrUns 

The field "pc"", shown here as a type ProgramAddrU 
dress, used in program address translation; see §7.1. 



reserved 

disables all traps, lookahead, and the 
instruction counter; allows some mem- 
ory operations to retrv forever: see 
§9.2 

allows loads to be speculative; see §6.4 
prevents unaligned data from raising 
the data-alignment exception: see §6.1 
disables lookahead, so that each mem- 
ory operation finishes before the next 
instruction is issued; see §3.1. 

disables the instruction counter; see 
§10 

floating-point rounding mode; see §5.2 
the program counter 

ns, is actually a structure of type ProgramAd- 



2.2 Branches and Targets 

There are two major families of branch operations. The JUMP familv is intended for general long- 
distance transfers including subroutine calls. The SKIP family adds a small positive offset to the 

2.2 Branches and Targets 
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?-l°^T'?;/r"'!'c^rW"*'°'*''^ "^^^^'i if-then-else situations 

The JLMP and SKIP families have variants for terminating lookahead if the branch is or is not 

taken: see §3.1. 

Jumps are performed in two distinct operations. First, a TARGET operation loads a target register 
with a program address. Second, the JUMP operation is executed. conditionaUv setting the ssw pc 
to the contents of the specified target register. Separating these two concerns lets the processor 
prefetch mstructions down an execution path that may be taken in the future. Loading target 
registers with invalid addresses will not raise an exception unless and until the target register is 
used in a successful JUMP operation. 

There are eight target registers. Each target register contains a program counter; see §2 1 Target 
register TO is reserved for the address of the trap handler. It is automaticaUv exchanged with 
the ssw.pc on a trap, and can be written by unprivileged streams (unless the ''priv tO" bit in the 
program state of the protection domain prohibits it). When a target register is loaded', the program 
cache attempts to prefetch the line containing the new address; see §7 2 
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Chapter 3: Instructions 

Every instruction is 64 bits long, ajid generally contains four fields describing lookaiead, an M- 
operation, an A-operation, and a C-operation. These fields are shown here. 



Bits 


Wd 


Field Name 


Type 


Description 


Operation 










63-61 


3 


la 


Uns 


lookahead 


60-47 


14 


Mop 


Uns 


M-operation 


46-21 


26 


Aop 


Uns 


A-operation 


20-0 


21 


Cop 


Uns 


C-operation 



The lookahead field is used to control M-unit operation overlap and is described in §3.1. In gen- 
eral, an M-unit operation (M-operation) accesses memory in some way, an A-unit operation (A- 
operation) performs arithmetic, and a C-unit operation (C-operation) is primarily responsible for 
control flow. The C-operation can also do some arithmetic operations, exclusive of multiplica- 
tion. Nearly every arithmetic operation that can be done in a C-operation can also be done by an 
A-operation. 

Some operations are encoded by combining multiple operation fields. For example, an MC-operation 
such as INTJ.OADJ)ISP uses both the M- and C-operation fields. STREAM.CREATE and 
STIIEAM.QUIT are MAC-operations. 

The operations in an instruction are decoded in parallel. If any of them is invalid, either because 
it is a privileged operation at the current protection level or it is an illegal operation encoding, a 
privileged operation exception is raised, and no part of the instruction is issued. 

The decoded operations are exe^ Jted in parallel. operands for all operations in the instruction 
are read before any result is written. Results are \^^itten in an implementation-dependent order, so 
if more than two operations in an instruction write to the same destination register, the resulting 
value is undefined. Thus, such an instruction is illegal. Once instruction execution is begun the 
destination registers are always written, regardless of whether or not the operation later raises an 
exception or traps. 

The program counter (Pc) foUows the same rule for reading and wTiting as the operands. The PC 
is read when the instruction is issued and is written when the instruction completes. The written 
value is either an incremented value for normal sequential flow or a new value from a branch. 
The individual operations are described in §11. 

3.1 Lookahead 

The lookahead field is a three-bit unsigned integer that the code generator must guarantee to be 
less than or equal to the minimum number of instructions that the stream might execute before 



3.1 Lookahead 
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encountenng one that depends on the current M-, or MAC-operation. The maximum possible 
lookaheaa value is seven. If there is no such operation, the code generator should set the lookahead 
to the maximum of seven. If the code generator is ignorant of the relevant dependences, the 
ookahead may be set to zero. The lookahead must take into account aU branch paths that are 
lookahead-enabled. as described below. 

An instruction J depends on an M-unit operation (M-, MC-, or MAC-operation) at a prior instruc- 

'° "'^^ °' ^''^"^ ^ °^ ^ of a register implicitlv or explicitlv 

defined by the M-unit operation in /. In addition, an instruction J depends on an M-unit operation 
at a pnor instruction / if the M-unit operation in J references some of the same memorv referenced 
m / and the memory is modified by either or both of / and J. These definitions are manifestations 
01 standard data dependence rules. 

The lookahead field supplies the hardware with an upper bound on the number of additional 
instructions that may begin execution before the current M-unit operation is finished. For example 
If lookahead is zero throughout a program, then the processor wiD finish each M-, MC- or MAC- 
operation before starting the next instruction. Lookahead can be disabled to get the same effect 
by setting the mode bit field ''lookahead.disable" in the ssw. 

Branch paths are determined by branching operations and their corresponding skip amounts or 
target registers. All conditional branch operations have variants that disable lookahead on one of 
the two paths. The "SELDOM" branch operations (JUMP^ELDOM. SKIP^ELDOM) disable 

S°^OFTrvT. JTf l^'.'^.'''' "-OFTEN" branch operations (JUMP.OFTEN, 

bKIP-OFTEN disable lookahead when the transfer is not taken. The effect of disabUng lookahead 
IS to require all outstanding memorj- references to complete before the next instruction is allowed 
to execute. 
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Chapter 4: Condition Codes 



Many operations have alternate versions (with "•-TEST'' appended to the mnemonic) that generate 
a condition code in addition to a value in a register. The eight possible condition code values and 
their default meanings are shown below, where 0, p, and n stand for zero, a positive integer, and a 
negative integer, respectively. 

Name Value Meaning Examples 



CondCodt 



COND^ERO_NC 


0 


Zero, no carry 


0 = 0 + 0 






COND_\EGJ^C 


1 


Negative, no cairy 


n = p + n, n 


= P- 


P 


COND_POS_NC 


2 


Positive, no carry 


P = P + P, P 


= P- 


n 


COND.OVFNANJs^C 


3 


Overflow/NaN, no carry 


n = p + p, n 


= P- 


n 


COND-ZERO.C 


4 


Zero, carry 


0 = n + 0 


= 71 — 


n 


COND_NEG.C 


5 


Negative, carry 


n = n + n, n 


= n - 


■P 


COND_POS_C 


6 


Positive, CcLTry 


p=n + p,j) 


= n — 


n 


COND.OVFNAN.C 


7 


Overflow/NaN, carry 




= n — 


P 



Each newly generated condition code is inserted as cvq at the low end of the four-element condition 
vector cv associated with the stream; the existing codes shift over and the old value of CV3 is lost. 
If multiple operations in the same instruction generate condition codes (because there are multiple 
"-TEST" suffixes), then the condition code from the C-operation is inserted first, followed by the 
condition code from the A-operation. 

After integer arithmetic operations, the condition code describes the sign of the result in the obvious 
way unless overflow has occiirred, in which case the result sign is negative if and only if there was 
no carry. Some integer and bit operations — such as INT JcIAX and BIT JUGHT.ONES — generate 
the carry bit in a nonstandard way; for these operations overflow/NaN is not generated, and the 
condition code still accurately reflects the sign of the result. 

After floating-point operations, the condition code describes the result in a way compatible with 
IEEE Standard 754. See §5 describing floating-point arithmetic. 

A condition mask, shown as cond in the operation descriptions, describes a set of condition code 
values by summing the powers of two corresponding to the codes in the set, typically to determine 
whether a branch should take place. A cond can describe any combination of condition codes. For 
example, the condition mask named IF-EQ (if equal) describes codes 0 and 4, so it has the value 
2° + 2^, which is 0x11 or llig. 

Most of the important condition masks have one or more names. The named condition masks are 
shown below. 



4.0 Lookahead 
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N 



ame 



Value 



After fSUB.TEST x y z) 



CondMask: Manifest 
IFJilW'AYS 

IF_\TVER 

CondMask: Equality 
IF^Q 

IF_ZE 

IF_F 

IF_NE 

IFJfZ 

IF.T 

CondMask: Integer Comparison 
IFJLT 

IFJGE 

IFJGT 

IFJLE 

IFJMI 

IFJPZ 

IFJPL 

IFJMZ 

CondMask: Unsigned Comparison 
IF.ULT 

IF.UGE 

IF.UGT 

IF.ULE 

CondMask: Float Comparison 
IF_FLT 

IFJGE 

IF-FGT 

IF^LE 

CondMask: Other Tests 
IFJOV 

IFJ-UN 

IF.CY 

IF_NC 



0 1 2 3 4 5 alwavs 
6 7 



04 

0 4 

0 4 

12 3 5 6 7 
12 3 5 6 7 
12 3 5 6 7 

1 5 7 
0 2 3 4 6 
236 
0 1457 
135 
024 6 7 
26 7 

0 13 4 5 

1 2 3 
0 4 5 6 7 
5 6 7 

0 12 3 4 

1 5 

0 2 4 6 

2 6 

0 14 5 

3 7 

3 7 

4 5 C 7 
0 123 



never 

y = r (integer, unsigned, float) 
X = 0 (integer, unsigned, float) 

X = 0 (logical) 

y # - (integer, unsigned, float) 
z T^O (integer, unsigned, float) 
X # 0 (logical) 

y < z (integer) 
y> 2 (integer) 
y>z (integer) 
y < 2 (integer) 
X < 0 (integer) 
X > 0 (integer) 
X > 0 (integer) 
X < 0 (integer) 

y < 2 (unsigned) 
y> 2 (unsigned) 
y> 2 (unsigned) 
y < 2 (unsigned) 

y < z (float) 
y>z (float) 
y> 2 (float) 
y<2 (float) 

X overflowed (integer) 
y and 2 are unordered (float) 
carry 
no carry 
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CondMask: Specific Conditions 



IF.0 


0 


7pm n Pk r :> rr\* 


IFJ 


1 


Negative, do carry 


IFJ2 


2 


Positive, no carry 


IFJ 


3 


Overflow/XaX, no carry 


IF.4 


4 


Zero, earn' 


IF.5 


5 


Negative, carry 


IF.6 


6 


Positive, carry 


IF.7 


i 


Overflow/NaN, carry 



4.1 Select Operations 



The SELECT, operations use three-bit encodings to specify one of eight of the most common con- 
dition masks. SELECTJNT uses a mask IntSelect that encodes integer and unsigned comparisons 
as shown below. Additional selects can be realized by reversing the arguments u and v of the 
SELECTJNT operation itself. 



Name 



Value 



After (SUB.TESTxy2) 



IntSelect 



SEL.CY 


0 


carry 


SEL-EQ 


1 


y = r (integer, unsigned, float) 


SELJGT 


2 


y>z (integer) 


SELJGE 


3 


y>2 (integer) 


SEL.UGT 


4 


y> 2 (unsigned) 


SEL.UGE 


5 


y> 2. (unsigned) 


SELJPL 


6 


X > 0 (integer) 


SELJPZ 


7 


x>0 (integer) 



The SELECT_FLOAT operation uses the encoding FloatSelect as shown below. 

Valpe After (FLOAT_MIN_TEST x y z) 



FloatSelect 
SELJLT 
SELJ-LE 
SELJGT 
SEL-FGE 
SELJUN 



2 
3 
4 
5 
6 



y < z (float) 
y <z (float) 
y > z (float) 
y> 2 (float) 

y and r are unordered (float) 



An IntSelect or RoatSelect enumeration describes the same condition code set as the identically 
suffixed CondAlask. 



4.1 Select Operations 



0 



FloatSelect 



I 
I 
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Chapter 5: Floating-point Arithmetic 



5.1 Floating-point Formats 



The IEEE Standard 754 floating-point double basic format (64 bit) and single basic format axe 
supported. This is the structure of a normal 64-bit floating-point number: 



Bits Wd Field Name 



Type 



Description 



Float64 

63 1 sign SignBit 

62-52 11 exponent Uns 

51-0 52 fraction Uns 

This is the structure of a normal 32.bit floating-point number: 

Bits Wd Field Name Type Description 



sign bit 

biased exponent 
fraction 



FloatS2 

31 1 sign 

30-23 8 exponent 

22-0 23 fraction 



SignBit 

Uns 
Uns 



sign bit 

exponent 

fraction 



Doubled preasion addition, subtraction, multipUcation. and conversion operations to and from the 
smg^e precision IEEE 754 format and both signed and unsigned integer formats are supported di- 
rectly. Division and square root are accompUshed with the help of iterative computation primitives 
that use a speaal floating-point format providing extra significand precision: 



Bits Wd Field Name 



Type 



Description 



SpecialFloat64 

63 1 sign 

62-53 10 exponent 

52-0 53 fraction 



SignBit 

Uns 

Uns 



sign bit 

biased exponent 
fraction 



The SpedalFloat64 exponent is biased by 510, so that the true exponent is the biased exponent 
minus olO. 

All operations conform to the applicable IEEE standard. 

Hoating.point comparison operations set the condition code to indicate whether the operands are 
equal, greater, less, or NaN. The carry bit indicates the second operand {vV z) is a NaN. The float - 
invaLd exception is never raised by FLOATuMIN or FLOAT_MAX. When the compare is performed 
By a tLUAT.CMP.TEST, float jnvalid is raised when the operands are unordered. Thus, IIEE 
7o4 tests which do not raise an exception on unordered operands, such as a test for equality, should 
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be implemented using FLOAT-MIN.TEST. Tests for inequalities such as greater than should use 
FLOAT.CMP-TEST to properly handle unordered operands. 

Support for fast doubled precision arithmetic is provided. In doubled precision, a pair of 64-bit 
floating-point numbers is used to hold twice the significant digits and provide at least twice the 
precision of ordinary 64-bit floating point. There are provisions to compute the doubled precision 
sum, difference, and product efficiently. See the doubled precision programming examples in §12.4. 



5.2 Rounding 

Unless explicitly specified otherwise in an operation description, rounding is performed according 
to the rounding mode stored in field "round-mode" in the ssw. The rounding modes are shown 
here. 

Name Value Meaning 



RoundMode 

RND JsTAR 0 round to nearest 

RND-CHOP 1 round toward zero 

RND-FLOOR 2 round toward -oo 

RND.CEIL 3 round toward oo 

Rounding is explicitly specified in some convert operations, such as FLOATXEIL, INT CHOP 
and UNSJ^LOOR. 



5.3 Floating-point Exceptions 

Floating-point exceptions are raised as a side effect of operation completion. The destination 
register of the operation is set in accordance with the IEEE Standard. 

Besides the 64-bit result in the destination register, a floating-point exception records the destina- 
tion register number and a four-bit floating-point result code in the result code register. A non2ero 
result code indicates that the destination register contains an exceptional value and summarizes 
that value. Floating-point result codes are described in §9.1. Note that a zero destination register 
will not allow the exceptional value to be saved. 

In conformance with IEEE Standard 754, an invalid operation exception is raised (and a trap po- 
tentially taken) when a conditional test operation encounters a NaN when performing an inequality 
test as described in §5.2. 

If overflow or underflow traps are disabled, then overflow delivei-s infinity or a maximum magnitude 
floating-point value, depending on the rounding mode, and underflow delivers a denormaiized result. 
When floating-point overflow or underflow traps are enabled, the result in the destination register is 
the same as the masked response, so that the trap handler may report the program state and resume 
execution. Note that underflow is only raised when the result is inexact and subnormal, whether 
the underflow trap is enabled or not. The check for subnormal is before rounding, so the final 
result may actually be normalized (due to rounding). A float-zero^vide exception always returns 
a properly signed infinity. A float jextension exception returns the argument to the operation (they 
are all unary) so that the trap handler may easily locate the value and complete the operation. 
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A floaiJnvalid exception generates a NaN value in accordance with the IEEE standard. A NaN 
generated by an operation describes the reason for the exception using the enumeration below. The 
appropriate code is stored in the low three bits of the fraction. 



Name 



Value 



Meaning 



NaNResultCode 

NAX^ERO-MULJNF 

NANJNF_SUBJNF 

NANJERO_DI\'^ERO 

NANJNFJ)IVJNF 

NA.\_SQRT_NEG 



1 
2 
4 
5 
6 



Zero times infinity 

Magnitude subtraction of infinities 

Zero divided by zero 

Infinity divided by infinity 

Square root of negative number 



The high-order fraction bits of a NaX are zero; these bits are Bits 51-3 for a normal 64-bit floating- 
point number, and Bits 22-3 for a 32-bit floating-point number. The destination register {t or x) 
is stored in the result code register, so that the NaN may be examined by the trap handler for 
diagnosis or continuation. 

There are no signahng NaNs, but data trap bits provide a more comprehensive mechanism; see 
§6.1. 
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Chapter 6: Data Memory 



A Tera system has either two or four data memory units per processor. When four units per 
processor are configured, the additional two units are referred to as "expanded data memory". 
Data is accessed by LOAD, STORE, FETCH J^DD, and STATE operations. This chapter describes 
what is stored in data memory, the semantics of accessing it, the address translation mechanism, 
and finally the internal state of the M functional unit. The M-unit can simultaneously process up 
to eight pending requests for data memory access by each stream. 



6.1 Data Memory Access 



Every data memory cell contains a 64-bit value and a four-bit access state. The value in a memory 
cell can be addressed as a word, 2 haifwords, 4 quarterwords, or 8 bytes. The order of bytes in 
quarterwords, quarterwords in haifwords, and haifwords in words is "big-endian'', i.e. packed so 
that addresses increase as significance decreases. Thus the word at address A, read from left to 
right, most significant bit to least, contains bytes with addresses A, A^l, . ,.A + 7: quarterwords 
with addresses A, ^ + 2, A -f 4, and A + 6; and haifwords with addresses A and >1 + 4. 

The access state modifies the behavior of memory references to the word or partial word contained 
in the cell. It has this structure: 

Bits Wd Field Name Type Description 



AccessState 

3 1 fuU Boolean full/empty bit 

2 1 forward-enable Boolean forpt-ard enable 

1 1 trapljenable Boolean data trap 1 enable 

0 1 trapO jenable Boolean data trap 0 enable 

The operations STATEJLOAD, STATE-STORE, and STATEJ.OCK axe respectively used to load, 
store, and lock the access state. 

Operations always access data memory relative to a pointer. The semantics of memory access are 
determined by an access control field in this pointer, possibly overridden by an access control field 
in the operation, and by the access state of the addressed memory cell(s). Briefly, the access can 
be forced to wait until the cell is either empty or full, a data blocked exception can be raised in 
response to load or store accesses to the cell, and a memory cell can forward accesses to another 
memory cell. 

A pointer has two parts: an access control part, which modifies access through the pointer, and an 
address part. The fields in a pointer are as follows: 
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Bits Wd Field Name Type Description 



Pointer: access control 

63 1 0 reserved 

62 1 fwd-disable Boolean forwarding disable 

61-60 2 fccontrol FullEmptyControl full/empty control 

59 1 trapl^tore.- Boolean data trap 1 disable on store 
disable 

58 1 traplJoad-disable Boolean data trap 1 disable on load 

57 1 trapO-store.- Boolean data trap 0 disable on store 
disable 

56 1 trapOJoad_disable Boolean data trap 0 disable on load 

55-48 8 0 reserved 



Pointer: address 

47-0 48 address DataAddrUns data memory address 

The field ''address'', shown here as a type DataAddrUns, is actually a structure of type DataAd- 
dress. used in data memory address translation; see §6.2. 

The value in the field '^e.contror is of type FullEmptyControl, described here. In the description 
of the load and store behaviors, the term ''waits for empty (full)'' means that the operation waits 
until the field '^ull" in the memory cell's full bit becomes false (true); the term "sets empty (full)" 
means that the field '^ull" is set to false (true). 

Name Value Behavior 



FullEmptyControl 

FE_XORMAL 0 LOAD loads; STORE stores and sets 

full 

FE-FUTURE 2 LOAD waits for full, then loads: 

STORE waits for full, then stores 

FE^YNC 3 LOAD waits for full, then loads and 

sets empty; STORE wadts for empty, 
then stores and sets full 

Some memory operations encode an access control operand, abbreviated ac, that supersedes the 
pointer's access control specification. The operation access control structure is shown here: 

Bits Wd Field Name Type Description 

OperationAccessControl 



4 1 fwd-disable Boolean forwarding disable 

3-2 2 fe.control FullEmptyControl full/empty control 

1 1 trapl-disable Boolean data trap 1 disable 

0 1 trapO-disable Boolean data trap 0 disable 



Memory reference operations first add the address field from a pointer held in register 5 to an 
optional scaled offset. Addition is done modulo 2"^^, The offset is derived from either another register 
y or from an unsigned literal displacement disp in the instruction and is then scaled (multiplied) 
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by the size in bytes of the addressed object. The length of the disp field varies so that the scaled 
offset covers the same set of meniory locations independent of object size. This sum is the effective 
address of a word; halfword. quarierword, or byte in memory. 

Then, the effective address is checked against the map limit for this domain. If the limit is exceeded, 
a data map limit exception is raised. 

Unless the field "unaligned.data.enable" is set in the ssw, a data alignment exception will be 
raised when an effective address presented to memory is not a multiple of the number of bytes in 
the addressed object. 

At this point, the data map entry is consulted. If the current privilege level of this stream is 
insufficient for the map's protection level for this t}-pe of operation, a data protection level exception 
is raised. Otherwise, the segment offset in the effective address is checked against the segment limit 
in the map entry. If the limit is exceeded, a data segment limit exception is raised. 

Xext, a data blocked exception is raised if a data trap bit is enabled in the addressed word and the 
corresponding data trap disable bit is dear in ac or in s when oc is not present. 

Forwarding is examined and handled next. If ac is present in this operation, its forward disable bit 
is used; otherwise that of s is used. If the selected forwarding disable bit is dear and the forward 
bit is enabled in the addressed word, then the cell may contain a forwarding pointer rather than the 
data itself. If the cell is forwarded and empty then the operation is retried later; the interpretation 
is that the forwarding pointer is locked. If the cell is full then the value in the cell is used as an 
effective word address for another memory access. No registers are modified in this process. The 
relative word position of a partial word access is unchanged; the three least significant bits of the 
forwarded effective address are copied from the original address. Data traps at forwarded locations 
are processed as usual; the data trap disable bits in effect are the original data trap disable bits. 
The forwarding disable bit in effect is the original (clear) forwarding disable bit. The full/empty 
control bits are taken without modification from ac or from s when ac is not present. 

Finally, synchronization is handled. The full bit in the addressed word is processed in conjunction 
with the full/empty control bits from ac, or $ when ac is not present. 

No memorj' full bit testing occurs if full/empty control is FE_NORMAL; in this case load operations 
fetch the value of the addressed word or partial word into register r, and store operations store the 
contents of r into the addressed word or partial word and set it full. 

K full/empty control is FEJUTURE or FE^YNC, then the memory full bit is tested. If its state 
is the one waited for, then the load or store of the value occun, and the memorv- full bit is chaaged 
if full/empty control was FE-SYNC. Otherwise, the operation is retried later. " 

When the operation is retried it starts over with the original address (before any forwarding). If 
the total number of memory ceU accesses due to forwarding and retrying exceeds "the retry limit in 
the data state descriptor, a data blocked exception is raised and the operation is aborted. Retries 
may also be caused by network contention, translation stalls, and other miscellaneous hardware ' 
events. When ssw^verride mode is set, aU memor}' operations except synchronizing loads, stores, 
and int Jetchj.dds will retry forever and will not raise the data blocked exception. 
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6.2 Data Memory Address Translation 



Data memory addresses are found in Bits 47-0 of pointers. A data memory address has tliis 
structure: 



Bits Wd Field Name Type Description 

DataAddress 

47-28 20 data-segment.- DataSegment data segment number 
number 

27-13 15 data_segment.- SegmentOffset data segment offset 
offset 

12-3 10 data_framej)ffset Uns data frame offset 

2-0 3 byte^ffset Uns bvte offset 



A complete data memory address is 48 bits long, potentially addressing 256 Terabytes of memorv. 
However, only 42 bits of the address are currently implemented, and the high-order six bits of 
the data segment number must be zero. The implemented data address space is consequently 4 
Terabytes. This space is partitioned into 16K segments, each of which can vary in size from 8 
Kbyte to 256 Mbyte in 8 Kbyte increments. 

Data memory address translation proceeds in five logical steps. The translation logic block diagram 
IS shown m Figure 6.1. First, the data segment number is validated. Second, the data segment 
inap IS accessed, yielding a data segment map entry. Third, the protection level is checked and 
the segment address is limited and relocated using values in the map entry. This yields a logical 
address in two parts, a logical unit number and logical unit offset. Fourth, the data frame offset is 
transformed so that memory references are scrambled, yielding the logical frame offset. Fifth, the 
logical address is distributed to spread references among the logical, units (the memory resources). 
These steps are now described in more detail. 

The first logical step is to validate the data segment number and select the data map to use for 
translation. If the protection domain's data map limit from the protection domain's data state 
descnptor is less than the data segment number, a data map Umit exception is raised. Otherwise, 
the resulting segment number and domain number are sent to the data segment map. At this point, 
the alignment requirements for the selected operation are checked against the effective address. If 
the reference is unaligned and field "unaligned-data^nable" of the ssw is clear, the data alignment 
exception is raised. 

The second step in the translation reads a data map entry from the data segment map. Each entrv 
has the structure shown here: 
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Figure 6.1: Data Mapping Logic Block Diagram 
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Bits Wd Field Name 



Type 



Descriotion 



DataMapEntry 



63-62 


2 


storeJevel 


Level 


minimum store protection level 


61-60 


2 




Level 


minimum load protection level 




o 
o 


U 




reserved 


56 


1 


Stall 


Booleam 


stall reierences to tins entry 




1 


locked 


Doolean 


lock this map entry into TLB 


54 


1 


djstnbution-- 
enable 


Boolean 


distribution 


53-52 


2 




AcSQurce 


select data memory, expanded data 
memory, or 10 P units 


51-48 


4 


0 




reserved 


47-40 


8 


logical-unit 


Uns 


logical unit number 


39 


1 


0 




reserved 


38-24 


15 


segmentJimit 


SegmentOlFset 


segment limit 


23-19 


5 


0 




reserved 


18-0 


19 


segment-base 


DataFrame 


segment base 



This map IS stored in local data memory, starting at the data map base for the given domain (see 
§8.4). To speed translation, a translation lookaside buffer (TLB) caches the map entries. Coherency 
IS maintained by flushing modified entries from the cache using the DATA3IAPJLUSH operation 
The desired entries to flush are specified with a domain data address, which combines the domain 
and data address to index the data map. Entries with field "locked^ set will onlv be evicted from the 
cache by a DAT.OIAP JLUSH operation which matches the entr>'. The implemented data map 
cache contains ol2 entries, with four-way associativity. Within each set, entries are replaced using a 
least-recently.used policy. To reduce contention between domains, the domain number times eight 
IS exclusive-or ed with the set index before addressing the TLB. The DATAJ^AP JLUSH \NY 
operation treats the TLB as direct mapped, using the low two bits of the tag as the set index, using 
a domain data TLB address. To flush all entries for a domain from the cache, each entry must 
be accessed. The flush addresses must sequence through all possible values of the set Jndex and 
set_number. The domain data address and domain data TLB address structures are shown below: 



Bits Wd Field Name 



Type 



Description 



DomainDataAddress 
63-60 4 domain 

59-42 18 00000 
41-35 7 tag 

34-28 7 set-number 
27-0 28 segment-offset 



Uns 



Uns 

Uns 
Uns 



the domain to which this data address 
pertains 

r'lserved 
data TLB tag 
data TLB set number 
untranslated bits 



Domain Data .Address 
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Type 



Description 



29 



DomainDaia TLBAddress 



63-60 


4 


domain 


Uns 


the domain to which this data address 
pertains . 


59-42 


18 


00000 




reserved 


41-37 


5 


tag 


Uns 


data TLB tag 


36-35 


2 


setJndex 


Uns 


data TLB set index 


34-28 


1 


set umber 


Uns 


data TLB set number 


27-0 


28 


segment-offset 


Uns 


untranslated bits 



The third translation step limits and relocates the segmented address. If the current privilege level 
of the stream storing (loading) data in this segment is less than the minimum store (load) protection 
level in field "store JeveP (field "loadJevel") of the data map entry, then a data protection level 
exception is raised. Note that many load operations will need store privilege to properly update 
the access state. However, store operations are implicitly given load privilege to properly follow 
access control wailing or trapping. That is, the store protection level is assumed to be no iiigher 
than the load protection level. 

The segment limit, field ^^segmentJimit", is compared with Bits 27-13 of the data address. If the 
segment limit is smaller, then a data segment liniit exception is raised. 

If the field "^stall" is set, then the operation is returned to the reiry pool and tried again later. 
This forced retry allows the supervisor to perform some memory management operations without 
stopping all activity in the domain. Note that the PROBE operation is not affected by field "stall", 
as its result is determined by the earlier segment limit check. 

Otherwise, Bits 27-13 of the data address, extended with zeros on the left to 19 bits, are added 
to the segment base field "segment.base" in the data map entry. The sum is sent to the address 
scrambler as the logical segment offset. 

The fourth step scrambles the Bits 21-3 of the data address, producing the logical frame offset. 
The concatenation of the logical segment offset and the data frame offset is treated as a 29-element 
vector in GF(2)^^. (GF(2) is the field with elements 0 aaid 1, and as its multiply operation, and 
exdusive-or as its addition operation; GT{2)^^ is the vector space of dimension 29 over this field.) 
The vector is scrambled by multiplying it by the unit scrambling matrix, a fixed 29-by-19 bit matrix 
whose low^order 19-by-19 bit submatrix is invertible. This multiply yields the logical frame offset. 

The scrambling matrix is chosen to make any sequence of constant-stride addresses spanning a 
length 5 < 2^® generate a nearly uniform distribution in the logical frame offset, which in turn 
generates a nearly uniform distribution in the physical unit number field and the low-order bank 
bits of the unit offset. Appendix C.l specifies the matrix and the inverse of the low-order 19-by.l9 

submatrix. 

The distributor takes the concatenation of the logical unit number (field "logical.mut" from the 
data map entry), the logical segment offset, and the logical frame offset as its logical address. The 
next step distributes this logical address to control physical locaUty of reference. The distributor 
transforms the logical address into a physical address consisting of an eight-bit physical unit number 
and a 29-bit physical unit offset. 

The distribution bit (field "distribution.enable") in the data map entry allows references to be 
spread over all p memories in the system, rather than staying within one memory unit. Here, p is 
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set via the scan system and usually matches the number of processors, making p a power of two. 
However, in the presence of a faulty memory, p may also be a power of two less one. Distribution 
divides the logicaJ unit address by p (or p -i- 1 with a faulty memory) to effectively right shift 
the low-order bits of the logical unit number into the high-order bit positions of the logicaJ unit 
offset and replace them with the low-order bits from the logical frame offset. Thus, the remainder 
becomes the new unit number and the quotient the unit offset. With a faulty memory, hardware 
mapping will allow distribution to bypass a faulty resource. This mapping may be set differently for 
each resource class, so that a system can run with any one faulty normal data memory resource and 
any one faulty expanded data memory resource. Even when distribution is disabled.' this mapping 
will be in effect, so that the logical unit space appears contiguous. 

This scheme allows distribution across physical memory units under control of the data map entrj'. 
A data map entry with distribution enabled will address all usable physical memory units in the 
system, implying that only words are addressed in each unit as (word) addresses increase 
from 0 to 2^^ - 1. Moreover, the logical unit number u appearing in a data map entry is required 
to be less than p when distribution is enabled. If u is too large, a data address unimplemented 
exception is raised. 

The low-order three bits of the original address are the byte address of the datum within the 
addressed word and are copied without modification into the low-order three bits of the final unit 
offset. 

The Tera MTA computer supports machine subsetting. This feature allows any power of two subset 
of a machine to appear to software as an independent machine. For example, a 16-processor machine 
could be split into two eight-processor machines. In such a case, the interconnection network need 
not be split, but can be shared. To support subsetting, a physical unit base register is set up to 
convert unit numbers from the 0 to p - 1 range to the appropriate range in the actual machine. 

The resulting address is then sent to the network for routing to a memory unit. The physical unit 
number and the field "memory .type" are used to construct the network address, which controls 
network routing. The memory type should be selected from the values in the following enumeration: 



Name Value Behavior 

Resource 

RES J)MEM 0 Expanded data memory 

RES JOM 1 Normal data memory 

I^SJOP 2 I/O processor 

The hardware supports an option which combines the normal and expanded data memory for global 
distribution. When that option is enabled, RES_DMEM selects the bottom 1 gigabyte per processor 
of the global memory pool and RES JOM selects the top 1 gigabyte. 

If the physical address is unimplemented, then a data protection exception is raised when the issuing 
operation completes. If the memor>- system detects an uncorrectable error such as a double-bit ECC 
error on the data loaded from memory, then a data hardware error exception is raised when the 
issuing operation completes. To help detect hot spots, successful loads which take an excessive 
amount of time to travel from the processor to the memory will raise a latency limit exception 
while performing the load. Synchronizing loads which retry are not subject to the limit until they 
succeed. This limit is set during system initialization. This implementation checks the limit with 
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a granularity of 16 cycles. The compares are performed modulo 4096 cycles, so that a latency of 
4112 would appear as a latency of 16. 

6.3 M-unit Internal State 

The M-unit processes memory requests that are generated by M-operations. The M-unit may have 
up to eight requests simultaneously pending for each stream. The -sl-unit completes each request 
asynchronously. 

For each stream, the state of any failed M-operations is held in eight pairs of registers called the 
data control registers and data value registers. A trap handler can save these register pairs using 
the DATA-OP.A-SAVE and DAT.A-OPD^AVE operations and can later use them to retrj' the 
operation with DATA-OP JIEDO. The values in these registers are now described in more detail. 

The eight data control registers contain address and control information for up to eight memory 
reference operations in progress in the M-unit, due to lookahead. The M-unit writes one of these 
registers when a memory operation is initiated, and reads it as it (re)tries the reference. When 
no operations are in progress, the program may read them directly using the DATA.OPA^AVE 
operation. Each of the data control registers contains a data control descriptor with the structure 
shown here. 

Bits Wd Field Name Type Description 



DataControlDescriptor 



63 


1 


0 




reserved 


62 


1 


fwd-disable 


Boolean 


forwarding disable 


61-60 


2 


fejcontrol 


FullEmptyControl 


full/empty control 


59 


1 


trapLdisable 


Boolean 


data trap 1 disable 


58 


1 


trapO-disable 


Boolean 


data trap 0 disable 


57-53 


5 


destj-eg 


Reg 


destination or source register 


52-48 


5 


restop 


iletryOpCode 


rest of the operation code 


47-0 


48 


address 


DataAddrUns 


byte address 



The value in the field '^restop" encodes the operation that failed and raised an exception. The high- 
order bit of the field "restop" is set if the operation was a load (more precisely, an operation that 
writes a register upon completion) and is cleared if the operation was a store. The Retry OpCode 
enumeration is shown here. 



Name Value Meaning 



RetryOpCode: Stores 






OPA_STOREB 


0 


STOREB 


OPA^TOREQ 


1 


STOREQ 


OPA^TOREH 


2 


STORXH 


OPAJSTORE 


3 


STORE 


OPA_STATILSTORE 


T 


STATE-STORE 


OPA^TORX-EMPTY 


11 


REG.STORE 
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RetryOpCode: Loads 



OP.-UXT_LOADB 


16 


INT-LOADB 


OP.UXTJ-OADQ 


17 


INTJLOADQ 


OP.UNTJLOADH 


18 


INT_LOADH 


OP.A_IXT_FETCH_A.DD 


19 


INT_FETCH_A.DD 


OPA.UNSJ-OADB 


20 


UNS-LOADB 


OP.\.UNSJ.OADQ 


21 


UNS_LOADQ 


OPA.UNSJ-OADH 


22 


UNSJ-OADH 


0P.1_L0AD 


23 


LOAD 


OP.\_STATEJ.OAD 


24 


STATE_LOAD 


OP.\_STATE_LOCK 


25 


STATE-LOCK 


OP.A_PROBE 


26 


PROBE 


OP.\JlEGJ-OAD 


27 


REG-LOAD 


OP-AJCRUB_LOAD 


29 


SCRUB_LOAD 


RetryOpCode: Internal Codes 






OPA-STREAM.CREATE 


4 


STREAM CREA.TE 


OPA_MAP_FLUSH 


5 


DAT.A^AP-FLUSH 


OPA_MAP_FLUSH-\NY 


6 


DATA3IAPJ'LUSH_A.NY 


OPA-DATA^TATE.- 


9 


DATA_STATE_RESTORE 


RESTORE 






OP--LSTREAM.CATCH 


12 


STREAM.CATCH 


OPA_DAT.A_OPDJAVE 


14 


DATA-OPD^AVE 


OPA_DATA.OPA_SAVE 


15 


DATA-OPA^AVE 



The eight data value registers contain the data (if any) that the M-unit was attempting to write 
using the memory operation in the corresponding data control register. These registers are explicitly 
read by the program via the DAT.l-OPD^AVE operation. 

6.4 Speculative Loads 

In speculative load mode, some data memory exceptions are deferred until the loaded value is 
used. In the usual circumstance, exceptional values are never used because the program (correctly) 
fails to use the prefetched data. Speculative loads allow data prefetching in iterative or recursive 
computations with data-dependent exit conditions. 

When speculative loads are enabled (fteld "specJoad.enable'' in ssw), a load with access control 
FE JCORMAL into register r that would otherwise raise a data alignment exception, a data segment 
limit exception, a data map limit exception, or a data protection level exception wiU instead 
place a data control descriptor (§6.3) in r and set the corresponding poison flag in the exception 
register(§9.1). Note that the field '^destj-eg'^ of the data control descriptor is redundant. All other 
exceptions, including a data memory retry exception, are raised whether speculative loads are 
enabled or not. Whenever r is used as a destination register (even by a successful load, speculative 
or not) its poison flag is cleared. Use of a poisoned register r as a source operand raises a poison 
exception, except in REG-STORE, REG-MOVE, SELECT, and TRAP JIESTORE operations. 
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Chapter 7: Program Memory 



A processor accesses instructions held in a program memory region of the data memory local to 
the processor. The term ''program memory" is used to refer collectively to this region in data 
memory. This chapter describes what is stored in program memory, the semantics of accessing it, 
the address translation mechanism, and the instruction cache. Since it is part of data memory, 
each ceU of program memory contains a four-bit access state and a word value, but this access state 
is ignored by the instruction fetching process. The value is a 64-bit instruction specifying up to 
three operations, one for each of the and C-units. 



7.1 Program Memory Address Translation 

Program addresses are 32 bits wide and are found in field ''pc" occupying the low-order 32 bits 
of the stream status word and in the target registers. Only 25 bits of the program address space 
are implemented; the most significant seven bits must be zero. Thus the physical address space 
of the implementation allows for up to four gigawords of physical program memory but only 32 
megawords are currently implemented. Program addresses are word rather than byte addresses 
and always address the data memory unit attached to the processor. A program address has the 
structure shown here: 

Bits Wd Field Name T\T)e Description 



PrvgrxLTnAddrcss 

31-12 20 prog-p age-number PageN umber program page number 

11-0 12 prog-page-offset Uns program page offset 

Program memory address translation for all streams (regardless of level) uses the translation logic 
shown in Figure 7.1. Address translation proceeds in four steps. The first logical step is to 
limit the program page number and select the program map to use for translation. Second, the 
appropriate program page map for this page is accessed, yielding a program page map entry which 
is concatenated with the program page offset to form a logical unit offset. Finally, the logical unit 
offset is scrambled, resulting in a physical unit offset. 

If the protection domain's program map limit (found in the protection domain's program state 
descriptor) is smaller than the program page number, then a program protection exception is 
raised. The selected map base is added to the page number to yield a unit offset into local program 
memor>', where the page map entrj^ is found. 

The program map entry from the program page map has the structure shov^n below. 



7.1 Program Memory Address Translation 
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level 



procecQon 
domain 



program 
state 



program 
map limit 



program 
map base 



compa 



program map 
limit excepbon 



program 
page number 



adder 



absolute page 
number 



program 
page o^set 



program map 



execute 
level 



comparc< 



program 

protection 

exception 



frame number 



bank 

scrambler 



physical 
unit o^set 



Figure 7.1: Program Mapping Logic Block Diagram 
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Bits \Vd Field Name 



Type 



Description 



Progra mMapEn t ry 
31-29 3 0 

28-12 17 progJraine ProgFrame 

11-2 10 0 

1-0 2 execJevel Level 



reserved 
frame number 
reserved 
execute level 



The implementation restricts the field '^prog Jrame'' to values from 0 to 2^^ -2. To reduce instruction 
fetch latency, a program map cache saves recently used map entries. This cache is kept coherent 
with program memory using explicit PROGRAM^AP_FLUSH instructions. The desired entries 
to flush are specified with a domain program address, which combines the domain and program 
address to index the program map. This implementation provides a 128-entry program map cache 
backed by the L2 instruction cache (i.e., the L2 instruction cache holds program map entries as 
well as instructions). The program map cache is not associative; however, a small fully associative 
victim cache provides some tolerance for contention. To reduce contention between domains, the 
domain number times eight is exclusive-or'ed with the set index before addressing the TLB. 



Bits Wd Field Name 



Type 



Description 



DomainProgramAddress 
63-60 4 domain 

60-33 28 0000000 
31-0 32 address 



Uns the domain to which this data address 

pertains 
reserved 

ProgramAddrUns program memor\' address 



When accessing the program map cache, the program address is equated to the ProgTlbAddr 
structure below. To flush the mappings for a domain from the TLB, the flush addresses must 
sequence through all possible values of the field "set .number". 



Bits Wd Field Name 



Type 



Description 



ProgTlbAddr 

31-25 7 00 

24-19 6 tag Uns 

18-12 7 setjiumber Uns 

11-0 12 framejoffset Uns 



reserved 

program TLB tag 
program TLB set number 
untrajcislated bits 



Smce the TLB is backed by the L2 instruction cache, map entries must be flushed from L2 as well. 
The structure below is used for addressing map entries in the L2 cache. The PROGRAIOIAP - 
FLUSH instruction automatically forwards a flush to the L2 cache, so that the sub-block of 64 mai 
entnes containing the referenced entry is flushed. The PROGRAMJ^AP JLUSHJINY operation 
flushes the whole Une containing the referenced entry. As before, flush addresses must sequence 
through all possible values of the field "set-number" in L2 as well. 



7.1 Program Memory Address Translation 
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Bits Wd Field Name Type Description 
Prog L 2 Address 

31-25 7 00 reserved 

24-21 4 set_nnmber Uns set number for L2 cache 

20-18 3 lineJndex Uns line index 

17-12 6 subblockJndex Uns subblock index 

11-0 12 frame^offset Uns untranslated bits 

When a stream attempts to issue an instruction, if the privilege level of the stream is not equal 
to the execution protection level field '^execJevel" in the corresponding program page map entry, 
then a program protection exception is raised. 

The program map yields a 17-bit frame number from field "progjrame" which is concatenated to 
the low.order 12 bits of the program counter, forming a 29-bit logical unit offset. 

The bits of this offset are scrambled exactly as for the data logical unit offset. 

The resulting 29-bit logical unit offset is also the physical offset; there is no distributor in the 
program address translator. The logical unit offset is then sent to the attached data memory unit. 
If the address is unimplemented, then a program protection exception is raised when an attempt 
is made to issue an instruction which could not be fetched due to this exception. If the memory 
system detects an uncorrectable error (such 33 a double-bit ECC error on the data being retrieved 
from memory), then an uncorrectable program memory exception is raised. 

Note that identical data and program logical offsets in the same data memory will address identical 
locations as long as the segment's data map entry field "distribution-enable'' is clear. 



7.2 The Instruction Cache 



There is a primary and secondary instruction cache for each processor to reduce the required 
program memory bandwidth and improve latency. The caches are non-blocking, so that other 
streams may access the caches while a miss is being handled. 

The primary cache (Ll) holds 1024 instructions, organized in lines of four words. The 256 lines 
are two-way associative, so there are 128 sets. Lines are replaced using a least-recently-used poUcy. 
Both the primary and secondary caches are tagged with physical addresses. Physical "addresses are 
-mapped to the Ll cache as shown below. 



Bits Wd Field Name 



Type 



Description 



LlAddress 
31-26 
25-9 
8-2 
1-0 



00 
tag 

set -number 
lineundex 



Uns 
Uns 
Uns 



reserved 
physical Li tag 
Ll set number 
Ll line index 



The secondary cache (L2) holds one quarter million words of instruction and program map data. To 
reduce the number of tags, the data is organized into lines of 256 words, with eight sub-lines of 32 
words each. The 1024 lines are four-way associative, so there are 256 sets. Here, lines are replaced 



L2 Address 
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using a random policy. Note that 16 lines can contain a program frame. Physical addresses are 
mapped to the L2 cache as shown below. 

Bits Wd Field Name Type Description 



L2 Address 










31-26 


6 


00 




reserved 


25-16 


10 


tag 


Uns 


physical L2 tag 


15-8 


8 


set-number 


Uns 


L2 set number 


7-5 


3 


lineindex 


Uns 


L2 line index 


4-0 


5 


subblockundex 


Uns 


L2 subblock index 



The PROGRAM.CACHE-FLUSH operations are provided so that the operating system can main- 
tain cache coherence when instructions in program memory are changed. These operations allow 
program frames to be flushed from the caches, so that subsequent accesses will fetch correct data 
from program memory. 

To flush all entries from a single frame from the Ll and L2 caches, each entry must be flushed with 
PROGRAM-CACHE-FLUSH. The address must sequence through all values of the Ll set number, 
and all values of the L2 set number within that page, amounting to 128 flushes. To flush all entries 
from the Ll and L2 caches, each entry must be flushed ^nih PROGRAM.CACHE-FLUSH-ANY. 
The address must sequence through all values of the Ll set number, and all values of the L2 set 
number, comprising a total of 256 flushes. PROGRAMXACHE JLUSH J-1 has the same effect on 
the Ll cache as PROGRAMXACHE JLUSH^NY, without affecting the L2 cache. 
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Chapter 8: Levels and Protection Domains 



8.1 Levels 

A stream can execute at one of four privilege levels: LEV.USER. LEV^UPER (supervisor), LEV.- 
KERNEL, and LEVJPL (initiaJ program load). Lower levels have fewer privileges. The privilege 
levels are defined here: 

Name Value Meaning 



user level 
supervisor level 
kernel level 

initial program load level 

User, supervisor, kernel, and IPL level streams are constrained in addressability by the' program 
and data maps. The data map entries define the minimum privilege levels needed to read and to 
write each segment, and the program map entries define the exact privilege level needed to execute 
from each page. 

The LEVEL J:NTER and LEVELJITN operations change stream privilege levels. A LEVEL.- 
ENTER must be the first operation executed at an entry point when the caller is from a different 
privilege level. LEVELJITN restores the original privilege level. The current privilege level is 
expressly not directly readable by a stream (although it can be inferred from the program map) to 
simplify the virtualization of privilege levels. 

The domain signal exception is set when the privilege level of the issuing stream is less than the 
domain signal level in the program state. The domain signal level is increased by the operating 
system when it finds it necessary to communicate with all streams in its domain, e.g. to prepare 
for a swap. 



Level 

LEV.USER 0 

LEV^UPER 1 

LEVJCERNEL 2 

LEVJPL 3 



8.2 Protection Domains 

A processor supports 16 protection domains, each of which implements an address space. Each 
domain has several registers holding stream resource limits and accounting information. By con- 
vention, one of the protection domains is reserved for operating system daemons. 

A stream runs in exactly one protection domain, denoted by D. When one stream activates another 
using the STREAM-CREATE operation, the new stream executes in the same protection domain 
as its creator and therefore inherits all of its creator's job-context. A stream's protection domain 
D is read by the privileged DOMAIN JDENTIFIER_SAVE operation and written bv the privileged 
DOMALNJ,EAVE operation. 

Level 



I 
I 
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Each protection domain has counters controlling stream resource allocation, a data state descrip- 
tor and a program state descriptor describing the data and program address spaces, and eight 
performance counters. 



8-3 Stream Resource Control 



The seven-bit counter SRESp contains the total number of streams reserved in the protection domain 
by STREAM-RESERVE operations. The seven-bit counter scvKo maintains a cotmt of the actual 
number of streams in use. It is constrained by the hardware to be less than or equal to sresd, 
is incremented by the STREAM-CREATE operation, and is decremented by the STREAM.QUIT 
operation. These counters may also be read by the STREAM-CUR-SAVE and STREAM-RES-- 
SAVE operations. 

The seven-bit counter SLlMp, found in the program state descriptor, contains the maximum num- 
ber of streams reservable by this protection domain, slim^ is an upper bound on sresd, the 
streams currently reserved the protection domain. The operating system sets SlllAo to prevent the 
protection domain from monopolizing the available streams. 

SLIM/) can actually be set below the current vaiue of sresd; because STREAM-QUIT actually 
decrements SRESp as well as scuR/j, both will be coerced lower as streams terminate. 



8.4 Data State Descriptor 



There is one data state descriptor per protection domain specifying how the data memory operations 
are interpreted. It is written using the DATA-STATE-RESTORE operation. The descriptor is 
shown here. Note that the field '"retryJimit" is multiplied by four in use. This factor allows retry 
limits up to 1024. As with memory addresses, the data map limit must have zeros for the six most 
significant bits in the current implementation. 



8.4 Data State Descriptor 
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Bits Wd Field Name 



TvDe 



Description 



DataStateDescriptor 
63-60 4 domain 

59-58 2 min.dkill 



Uns 
Level 



57-48 10 0 
47-28 20 limit 



27-20 8 retryJimit 



DataSegment 



Uns 



19 1 0 
18-0 19 base 



DataFrame 



the domain to which this descriptor 
pertains 

data minimum level not killed; if a 
memory operation is selected for issue, 
and its stream's privilege level is be- 
low field "min.dkill'', then it fails with 
result DR.UXIMPLEMEXTED«OP, 
raising data.prot 
reserved 

data map limit: the largest data seg- 
ment number available to the domain; 
see §6.2 

data memory retry limit: bounds the 
number of times that a memory oper- 
ation can be retried before failing and 
raising the data memory retry excep- 
tion; see §6.1 
reserved 

data map base, added to data segment 
numbers to yield an offset into local 
data memory: see §6.2 



8.5 Program State Descriptor 



There is one program state descriptor per protection domain. It contains sweral kinds of informa- 
tion relating to instruction interpretation within the domain and is written using the PROGRAM-- 
STATE-RESTORE operation. This descriptor is defined below: 



Program State Descriptor 



Levels and Protection Don. ..us 

Bits Wd Field Name Type 



Description 



ProgrcmStateDescriptor 

63-60 4 domain Uns 

59-58 2 min«pkill Level 



57-56 2 nain-psleep Level 



the domain to which this descriptor 
pertains 

program minimum level not killed: if 
the privilege level of an issued stream 
is less than the value in field "min.- 
pkiU"", then the stream branches to a 
virtual address set by the scan system, 
presumably to execute a STREAM.- 
QUIT; after branching, the stream has 
all traps masked and cannot be pkill'd 
again 

program minimum level not sleep- 
ing; if the privilege level of an issued 
stream is less than the value in field 
"min-psleep", then all side effects of 
the instruction are suppressed includ- 
ing counter increments; if both min.- 
pkill and min.psleep are set, psleep 
has precedence over pkill 



ProgramStateDescriptor 
55 1 0 
54 1 priv.tO 



53 



1 piiv.quit 



Boolean 



Boolean 



52-50 3 0 
4&-48 2 allsig 



Level 



reserved 

writing target register TO is privileged, 
so that unprivileged streams may not 
freely change the address of the trap 
handler; see §9.2 

iSTREAM-QUIT operations become 
privileged, to provide an opportunity 
to clear the stream's registers before 
releasing it to the hardware for reallo- 
cation; see §2 
reserved 

minimum level not signaled; if a 
stream is selected for issue and its 
privilege level is less than the value 
in field "allsig", then the stream will 
raise the domain signal exception 
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ProgramStateDescriptor 
47 1 0 

46-40 7 slim 



36- 38 2 0 

37- 18 20 Umit 



17 1 0 
16-0 17 base 



Uns 



PageNumber 



ProgFrame 



reserved 

stream limit, slim£), which limits the 
maximum number of streams that 
may be reserved for this protection do- 
main, see §8.3 
reserved 

program map limit, the largest pro- 
gram page number available to the do- 
main: see §7.1 
reserved 

program map base, added to program 
page numbers to yield an offset into 
local data memory; see §7.1 
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Chapter 9: Exceptions and Traps 



An exception is an unexpected condition raised by an event in the user program, the operating 
system, or the hardware. The exception register summarizes exceptional conditions and the result 
code register describes floating-point and memory exceptions more fully. 

Exceptions can cause a trap to be triggered the next time the stream is ready for execution. 
However, a set exception flag not trigger a trap if that trap has been disabled bv one of the 
trap-disable bits in the trap mask of the ssw. If an exception is raised while its trap is disabled 
and the trap is later enabled, the trap wiU be taken then. Once raised, an exception flag remains 
set until explicitly cleared by software. 

Multiple exceptions can occur simultaneously For example, if a stream uses lookahead to issue two 
concurrent loads, the two loads can finish together (between instructions of the issuing stream) 
Suppose one load raises a data trap 0 exception and the other load raises a data trap 1 exception 
It IS up to the program (usually a trap handler) to decide the order in which such exceptions are 
processed. 



9,1 Exceptions 



The exception register is manipulated using the privileged EXCEPTION-SAVE and EXCEPTION. 
RESTORE operations. An enumeration for the exceptions is shown below. 



Name 



Value 



Meaning 



Exception: Hardware Exceptions 
Ex-Data-HWXrror 



ExJ>rogJEW_E; 



rror 



Exception: System Exceptions 
ExJnstruction.Count 
ExJ)ataJrot 

Ex_ProgJ^rot 
Ex-Poison 

Exception: Signal Exceptions 
EjuDomain^ignal 



57 



56 



52 
51 
50 
49 



48 



data memory error or network hard- 
ware error 

uncorrectable program memor}' error 



instruction count became 0 
data protection 
program protection 
use of a poisoned register 



domain signal 
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Exception: User Exceptions 
Ex.Create 

E>L_Privileged 

Ex-D a t a-.AJi gnm en t 

ExJData-Blocked 



41 



43 
42 



44 



stream create exception 

privileged operation 

unaligned data exception 

data memory retry exception or data 
trap 

float software extension exception 



Ex«Floatj;xtension 
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Exception: Floating-point Exceptions 
ExJ"loatJn valid 

Ex_FloatJeroJDivide 
ExJTloat .Overflow 
Ex-Float-Underflow 
Ex-FloatJnexact 



36 
35 
34 
33 
32 



float invalid operation 
float zero divide 
float overflow 
float underflow 
float inexact 



Of these exceptions, there are four that are raised by an instruction before it can execute. They 
are progJiw.error, prog.prot, poison, and privileged. If one these exceptions is raised while it is 
masked, the stream will hang. If such an exception is raised and is unmasked, the stream will trap 
with the trap pc (in TO) pointing to t he instnictio n that caused the exception. If possible, the trap 
h^dler could apply some ^'antidote^and then try to execute the instruction again by returning to 

Two exceptions are not raised by the instruction at all: domain^ignal and instruction-count. 
Generally, the trap handler can service the event and return to the program address in TO to 
continue. When these exceptions are masked, they are simply ignored. 

Most exceptions are caused by the previous instruction. Since that instruction may have contained a 
jump, its address is lost. Exceptions in this class include create and all the floating-point exceptions. 

Presumably, a failing STREAM-CREATE could be handled by reserving a stream and then retrying 
the create. The easiest way to retry the create is to decrement TO and jump to it. In this case, 
there could be no jump in the previous operation since a create is a MAC-operation. A masked 
create exception will be ignored, although no stream will have been created. 

On a floating-point exception, the handler may want to examine the source registers, the operation, 
and the value written to the destination. Sometimes, it will want to place a new value in the 
destination based on the above information as well as on some global state. Masked floating-point 
exceptions are simply ignored. 

Finally, there are the data exceptions, which may be caused by any of the last eight instructions 
executed. These are data-hw.error, data-prot, data-alignment, and data-blocked. To handle these 
exceptions, the trap routine should check the data result codes and the corresponding DATA-OPA 
and DATA.OPD state. 

The exception register is shown below. There is one bit In the upper half of the exception register 
for each member of the Exception enumeration shown above. Each of these exception bits can 
cause a trap, but some ssw bit disables that trap; the left-hand column tells which one. 

The lower half of the exception register contains the poison flags. A speculative load operation that 
would otherwise raise an exception as described in §6.4 instead sets the poison flag corresponding 
to its destination register. These flags do not trigger traps directly, as confirmed by the "no trap" 
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annotation. Should an instruction use a poisoned register as a source operand, the poison exception 
will be raised and a trap may then occur. The poisoned register contains a data control descriptor 
(§6.3), permitting re-execution or diagnosis of the failed load operation by the exception handler. 



Trap Disable Bit Bits \Vd Field Name 



Type 



Description 



ExceptionRegister: Hardware Exceptions 

63-58 6 00 
hardware 57 1 datajiw.error 



hardware 



56 1 prog-hw.error 



Flag 
Flag 



reserved 

data memory error or network hard- 
ware error; see §6.2 

uncorrectable program memory error; 
see §7.1 



ExceptionRegister: System Exceptions 
55-53 3 0 
svstem 52 1 



system 51 1 



system 50 



inst ni cti on-count Flag 
data-prot Flag 



prog-prot 



system 49 1 poison 



nag 



Hag 



reserved 

instruction count became 0; see §10 
data protection level, man limit, seg- 
ment limit exceeded, unimplemented 
op, or unimplemented address; see 
§6.2 

program protection level, limit viola- 
tion, or unimplemented address; see 
§7.1 

use of a poisoned register; see §6.4 



ExceptionRegister: Signal Exceptions 

domain signal 48 1 domain-signal Flag domain signal: set when the stream 

level is less than the domain signal 
level; see §8.1 



ExceptionRegister: User Exceptions 
47-45 3 0 
user 44 1 create 



user 43 

user 42 
user 41 



privileged 



Hag 



Hag 



1 data^gnment Flag 
1 datajblocked Flag 



user 40 1 floatjextension Flag 
9.1 Exceptions 



reserved 

stream create exception: attempt to 
create more streams than are reserved; 
see the STREAM-CREATE operation 
unimplemented or privileged opera- 
tion; see §3 

unaligned data exception; see §6.1 
data memory retry exception, latency 
limit exception, or data trap 0 or 1; 
see §6.1 

float software extension; see §5.3 

Exception Register 
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EzceptionRegister: Floating-point Exceptions 





39-37 


3 


0 




reserved 


float invalid 


36 


1 


floatJnvalid 


Flag 


float invalid operation 


float zero divide 


35 


1 


floauero.divide 


Flag 


float zero divide 


float overflow 


34 


1 


float.overflow 


Flag 


float overflow 


float underflow 


33 


1 


float.underflow 


Flag 


float underflow 


float inexact 


32 


1 


float inexact 


Flag 


float inexact 



EzceptionRegister: Poison Flags 

(no trap) 31 1 pf31 Rag poison flagai 

(no trap) 1 1 pfl Flag poison flagj 

0 10 reserved 



The result code register contains a more detailed description of the results of memory and floating- 
point operations. The structure of the result code register is described here. 

Bits Wd Field N ame Type Description 

ResultCode 



ResultCode: A-unit Float Results 

63-56 8 0 reserved 

55-51 5 A-float-result-reg Reg previous A-unit result register 

50-48 3 A-float Jesuit-- Float Result Code A-unit result code 
code 



ResultCode: C-unit Float Results 

47-40 8 0 reserved 

3&-35 5 C-fioat-result-reg Reg previous C-unit resxilt register 

34-32 3 CJloat j-esult-- FloatResultCode C-unit result code 
code 



ResultCode: M-unit Data Results 

31-28 4 dr7 DataResultCode data resultr 

3-0 4 drO DataResultCode data resulto 



The FloatResultCode that is stored in field "A-float .result .code" or field "C-floatjresult.code" is 
described below. When no exception or only float Jnexact is raised, the result code is set to field 
"FR-FG^. When floatJnvalid, float^zero-divide, float^overflow, or float.underflow is raised, the 
result code is set to field "FR-PX". All other result codes are coupled with the float jextension 
exception. The result register field is xvTitten whether or not the result code is nonzero. 



FloatResultCode 
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Name 


Value 


Meaning 


FloatResultCode 






FR_FG 


0 


float good 


FRJM 


3 


operand to integer multiply is too 
large 


FRSX 


4 


float is exceptional 


FRJ)Z 


5 


divide by zero 


FR_DR 


6 


denormaiized operand to FLOAT_- 
RECIP_AJ>PROX 


FR_DQ 


7 


denormaiized operand to FLOAT_- 
RSQRT_A.PPROX 



Each of the four-bit data result fields in the low-order part of the result code register is written 
when an M-operation completes and contains one of the values shown below. 



Name 



Value 



Meaning 



DataResultCode 

DR-NONE 0 

DRJ)ATA.TRAPO 1 

DILDATA-.TRAP1 2 

DRJ)ATA.TRAP01 3 

DRJIETRY-LIMIT 4 

DRiATENCY-LIMIT 5 

DRJDATAJILIGNMENT 6 

DR.UNIMPLEMENTED-OP 7 



DR-MAPJ-IMIT 8 

DR.PROTECTION-LEVEL 9 

DR^EGMENTJ-IMIT 10 

DR-UNIMPLEMENTED.- 11 
ADDRESS 

DR.UNCORRECTABLE.- 12 
ERROR 



the operation completed successfully 
data trap 0 exception 
data trap 1 exception 

both data trap 0 and data trap 1 ex- 
ception 

data memory retr}' exception 
data memory latency exception 
data alignment exception 
unimplemented operation by DATA-- 
OPJIEDO, or aborted by dkill; see 
§8.4 

data map limit exception 
data protection level exception 
data segment limit exception 
data address unimplemented excep- 
tion 

uncorrectable data memory exception 



The data result fields in the exception register are normally read by the trap handler to diagnose 
failing memory operations. Data result code / corresponds to the data control descriptor and data 
retrieved by the DATA.0 PA-SAVE or DATA-OPD-SAVE operations with opno i. To simplify trap 
handling with one or zero failing memory operations, the most recent failing memory operation is 
relabeled as opno 0. 



9.2 Traps 
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9.2 Traps 

A trap exchanges the ssw.pc with the contents of target register TO and sets ssw.override mode in 

o a t; J7 nVt'" r '"^''^ corresponding trap disable bit is clear 

or a trap disable bit is cleared when a corresponding exception bit is set: see §9.1. \ stream that 
traps does not change its privilege. The trap is lightweight in the sense that onJv a"small amount 
of state need be saved before control is transferred to a user-supplied exception handler. While 

UHil mT't aI^ ''\fi "'P' ^'^^^^^^^ instruction counter 

IS disabled. In addition. aU but synchronizing loads, stores, and intjetch^dds will retry forever 
to prevent spurious retries from causing a nested exception. The trap handler should return to 
the main program us ng a LEVEL JITN with the appropriate level. This form of jump will dear 
ssw^vemde mode, allowing the next instruction to use the true ssw mode and trap mask bits. 
Operations that set TO are supervisor-privileged if field •^riv.tO" is set in the program state descrip- 
tor of the protection domain; this option allows auditing of security-relevant events by a trusted 
(but not necessarily privileged) trap handler. However, restoration of the trap handler entrv point 
when resuimng execution of the interrupted activity must be done at privUege level LEviuPER 
or higher if field "priv.tO" is set. 

Note that a stream can disable all traps, including system and hardware traps, although it mav be 
unwise to do so. DisabUng traps wiU not necessarily stall the processor or the stream. The operating 
system can easily regain control by raising field "min-pkiU" in the program state descriptor. If the 
stream s privUege level is less than field '^min.pkill", then the stream will branch to a fixed virtual 
address, generally containing a STREAM.QUIT. If a stream were to encounter a prog4>rot, prog.- 
hw.err, or pnvilege exception after branching in response to pkill. hardware diagnostic intervention 
is necessary to recover the stream. 

There are eight trap registers available for the trap handler to use as temporary storage as it saves 
or restores processor state. Other uses are discouraged. The trap registers are manipulated bv 
the TRAP^AVE and TR.\P JIESTORE operations. Due to hardware limitations, trap register 
sets are allocated and deaUocated from streams on demand. That is. the first TRAP-RESTORE a 
stream performs will allocate a trap register set. That set will be deallocated when the stream issues 
a TRAP^AVE of TRO. The current implementation provides 32 sets to serve the 128 streams. To 
protect the operating system, the last trap register set will not be allocated to a user level stream. 
If a stream tries to allocate a trap register set and fails, that issue is squashed. 
Every taken trap counts as a CNT-TRAP event. 
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Chapter 10: Resource Counters 

10.1 Instruction Counter 

Each stream has a 16- bit unsigned instruction counter that is intended for debugger support. 
When an instruction issues, the instruction counter is decremented if the field "countjdisable" in 
the ssw is not set and the counter is not already zero. If the instruction counter becomes zero, 
then an instruction count exception is raised. The instruction counter is set by the STREAM.- 
COUNTJNSTJIESTORE operation and is read by the STREAMXOUNT JNST operation. 

10.2 Protection Domain Counters 

Each protection domain maintains eight 64-bit resource counters. These counters are only updated 
every 256 cycles, which limits their resolution. 

instruction issue counter 

The instruction issue counter increments when an instruction issues in the domain. This 
counter is read by the COUNTJSSUES operation. 

memory reference counter 

The memory reference counter counts the number of memory LOAD, STORE, FETCH J^DD, 
or STATE operations that are issued in the domain. This counter does not count memory 
retries or additional memory fetches required for forwarding. When divided by instruction 
issues, this counter provides an indication of the average number of memorj' references per 
instruction. This counter is read by the COUNT-MEMREFS operation. 

stream counter 

The stream counter is incremented every 256 ticks by the contents of the protection domain's 
SRESx? counter. When multiplied by 256 and divided by cycles, this counter provides an 
indication of the average stream usage of the domain. This counter is read by the COUNT- 
STREAMS operation. 

concurrency counter 

The concurrency counter is incremented every 256 ticks by the number of memory operations 
in the protection domain that have issued but not yet completed. When multiplied by 256 
and divided by cycles, this counter provides an indication of the average number of memory 
operations in progress. This counter is read by the COUNT.CONCURRENCY operation. 

selectable event counters 

The four selectable event counters can be set to count any four of a sizable number of events. 
The events counted are selected by the event counter select register, which is set by the 
supervisor-privileged COUNT^ELECT^STORE operation and is read by the COUNT.- 
SELECT-SAVE operation. The event counter select register has the structure shown here: 
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Bits Wd Field N'ame 



EventSelect 
63-32 32 
31-24 S 
23-16 8 
15-8 8 
7-0 8 



00000000 

seLO 

seU 

sel-2 

sel.3 



CountSource 
CountSource 
CountSource 
CountSource 



The value of a selectable event counter is read 
The CountSource tag can be one of the values 
value has undefined results. Setting the tag 
results. 

Name Value 



reserved 

tag for event counter 0 
tag for event counter 1 
tag for event counter 2 
tag for event counter 3 

by the COUNT-EVENTS operation. 

shown below. Setting the tag to an undefined 

to denote a dedicated counter has undefined 

Meaning 



0 
1 
2 



CountSource: other operations 
CXT-M_\OP 

CNT_A._NOP 

CNT.C-NOP 

CovntSource: target registers 
CNT.TARGET 



CountSource: data memory 

CNTJ.OAD 4 

CNT_STORE 5 

cntjntj-etchjvdd 6 

cnt_\iem_retry 7 

CountSource: floating operations 

CNTJFLOAT-A.DD 8 

CNT-FLOAT_MUL 9 

CNTJLOATJDIV lo 

CNTJ'LOAT-SQRT u 

CNTJ-LOAT-TOTAL 12 
CountSource; branches 

cntjumpj:xpected 13 

cntjuxmp.unexpected 14 

ckt.transfer.total 15 



NOP operations executed by the M- 
unit 

NOP operations executed by the A- 
unit 

NOP operations executed by the C- 
unit 

TARGET set operations (not includ- 
ing TARGET^AVE) 

LOAD operations issued 

STORE operations issued 

INT J"ETCH_A.DD operations issued 

memory operations retried, including 

forwarding 

FLOAT_A.DD and FLOAT-SUB oper- 
ations 

FLOAT_ADD_MUL operations 
FLOAT_DIV operations 
FLOAT^QRT operations 
total floating-point operations 

expected JUMP or SKIP path taken 
unexpected JUMP or SKIP path 
taken 

sum of all transfer operations 



CountSource 
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CountSource: level switches 
C.XTJ.EVEL 



16 



operations 



CountSource: traps 
CNT.TRAP 



17 



traps taken 



CountSource: streams 



CNT.CREATE 



18 
19 



STREAM.CREATE operations 
STREAM-QUIT operations 



CNT.QUIT 



CountSource: dedicated counters 
CNTJSSUES 
CXT«MEMREFS 
CNT-STREAMS 
CNTXONCURRENCY 



128 
129 
130 
131 



10.3 Processor Counters 

Each processor maintains three 64-bit counters, 
clock 

The clock increments once every tick. It is initialized using the hardware scan mechanism 
during IP L. By convention, the clocks on all processors are synchronized: that is they agree 
in value. The clock is read by the CLOCK operation. 

phantom counter 

The phantom counter counts the number of instruction issue slots unused by its processor. 
Normally, an instruction will execute to completion once issued, so that true phantoms are the 
main source of unused slots. However, some exceptions, such as a poison exception, will cause 
the triggering instruction to be aborted, lasting'' an issue slot. These aboned instructions 
count 35 phantoms as well. 

ready counter 

The ready counter on each processor sums the total number of streams ready at each tick of 
its processor. However, a stream that issues as soon as it becomes ready will not contribute 
to this coimter. Due to stream scheduling constraints, a ready stream may wait a few cycles 
before it issues, even on a processor with free issue slots. 
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Chapter 11: Operation Descriptions 
11.1 Notation 

A TEEA MTA assembly language program has the same syntactic form as a sequence of Lisp 
expressions. Each instruction has the format: 

(INST lookahead M-operation A-operation C-operation) 

If any of the or C-operations is not specified, then the assembler will fill the missing 

operation with a NOP. Operations should be specified in order when one of them can be executed 
by more than one functional unit, e.g. FLOAT-ADD. 

An operation belongs to exactly one group. The operations in a group have minor differences, 
such as in the way opersinds are addressed, whether the M-, A-, or C-unit does the operation, and 
so forth. Each page in this section describes an operation group. The following page describes a 
sample operation group. 
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[OPERATION A operand'template) ... u r 01 02 CLASS 

64 42 3: :2 2: 21 0 

pseudo code description of operation-1 
{where variable 6 set) 

[OPERATION operand-template) ... u i- 01 03 CLASS 

64 42 22 2: 2: C- 

pseudo code description of operation-2 
{where variable G set} 

The operations in this group (here, OPERATION.l and 0PERATI0NJ2) are described in more 
detail in this section. 

The CLASS describes the set of M. A, and C instruction fields used by this operation. 

An identifier always refers to the same value within the description of an operation. The identifiers 
r. 5, t, u, V, u\ 2, y, r always refer to the contents of a register in a fixed position within the 64-bit 
instruction. Subscripts on identifiers are bit subscripts. Unless otherwise stated, the range of a 
subscript is from 63 down to 0. Bit numbers increase from right to left. An identifier denoting an 
immediate constant is quoted, e.g. 'disp. The range of an immediate constant is always constrained 
by a where clause. 

The clause "{where predicatey is a constraint The predicate is a Boolean function that must 
be true. The most common constraints bound the range of immediate constants. 

The pseudo code description uses a conventional Algol-like language containing flow statements, as- 
signment statements, and expressions, built from operators and operands. Operators are described 
below. An operand may be a constant (interpreted in base 10); a quoted identifier, such as 'disp, 
denoting an immediate \'alue: or a non-quoted identifier, such as r, denoting the contents of the 
register addressed by the value bound to that identifier. 

The assembly code prototype for an operation is: "(OPERATION operand-template)''. The en- 
coding for each operation is described using fields. The fields used by an operation are listed from 
left to right within the word, from bit 63 to bit 0. Each field has two parts. The field end is 
the field's low-order bit number. The field fill can be an identifier, which stands for its value; 
a literal constant, which is represented in hexadecimal; or an ellipses representing "holes" in the 
object encoding. For example, the encoding for OPERATION.1 denotes a field starting at bit 41 
and ending at bit 37 containing the register number u, a field starting at bit 36 and ending at bit 
32, containing the register number v, a field starting at bit 31 and ending at bit 27 containing the 
literal value "^Olie", and finally a field starting at bit 26 and ending at bit 21 containing the literal 
value "0216". 

For the most part, the order of fields in the object code encoding is the same as the order of fields 
in assembly language. Deviations from this rule are noted explicitly. In any event, the mapping 
from assembly to machine code is manifest in the encodings. 

EXAMPLES 

An example of how the instruction might be used is given here, 
RAISES 

The exceptions that the operation may raise are enumerated here. 
COUNTS AS 

The event counters that are incremented by this operation. 
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SEE ALSO 

The names of related operations and section numbers are given here 
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11.2 Operation Naming Conventions 

The mnemonics for operations are chosen according to these general rules. 

• The mnemonics for most operations start with the name of the data type being manipulated. 
The principle exceptions are the STORE family, the SHIFT family/the JUMP family, and 
the miscellaneous operations dealing with program stale. The data type prefixes are shown 
in Figure ll.L 

• Mnemonics ending with ".TEST'' generate a condition code. 

• Mnemonics containing JIESTORE'' move value(s) from general purpose registers into special 
registers. 

• Mnemonics containing ''.SAVE'' move value(s) from special registers into general purpose 
registers, 

• Mnemonics containing - JMM" contain a small immediate constant operand. Some of these 
operations accept one value in the assembly code, but place another value in the object code. 
For example, INT J\.DD JMM takes and adds a value from 1 to 32. but the value put in the 
object code ranges from 0 to 31; the hardware increments this value to produce the desired 
sum. 

• Mnemonics containing "-MAP" deal with the program or data map. 



data type what 



BIT a word of bits 

BIT-MAT an 8 * 8 matrix of bits packed into a word 

COUNT a resource count 

DATA M-unit state 

DOMAIN a protection domain 

EXCEPTION an exception 

FLOAT a floating-point number 

INT a signed integer 

LEVEL a privilege level 

LOGICAL a 64-bit wide logical value: O(false)., l(true) or -l(true). 

PTR a pointer to memor)% with access control 

STATE memory access state bits 

STREAM an instruction stream 

UNS an unsigned integer 



Figure 11. 1: Data Type Prefixes 
11.2 Operation Naming Conventions 
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• Mnemonics containing "^A-C use access control from the operation when computing an ad- 
dress. 

• Mnemonics containing "JNDEX" use scaled indexing when computing an address. 

• Mnemonics containing "JDISP" use scaled displacements when computing an address. 

11.3 Pseudo-code Operators 

Infix operators in expressions have the precedence and associativity customary to the C language. 
Parentheses are used in complex expressions to avoid confusion. The operator is lowest prece- 
dence and separates sequentially executed statements^ In addition, the operators shown in Fig- 
ure 11.2 are used. ° 



'Note that in ISP means paiallel execution of the statements. We adopt the conventional Algol semantics. 
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assignment 


store 


store to memory 


load 


load from memory 


V 


logical or 


A 


logical and 


G 


logical exclusive-or 




addition 




subtraction 




multiplication 


/ 


division 


min 


minimum value 


max 


maximum value 



square root 
tally the number of 1 bits 
€ is a member of 

>a shift right, with sign bit filling 
» shift right, with 0 filling 
< shift left, with 0 filling 

rotate right 

rotate left 

[t...j] range + l,...,j - l,i 

ai bit number i from a 

Or bits in the range r from a 



Figure 11.2: Pseudo-code Operators 
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(BIT^ND t u v) 

t ^ u Av 

(BIT^ND X y 2) 
2 — y A z 

(BIT-AND.TEST t u v) 

t ^ U Av 

(BIT-AND.TEST x y z) 
X *-y Az 

These operations compute bitwise and. 
BIT-\ND-TEST never generates overfiow/NaN or carry. 



... i u r 01 OC ... A 

6i 47 43 37 22 27 2i 0 
M 21 le^U 6 0 

... ^ li r 01 OD ... A 

64 47 42 37 32 27 21 0 

. . . r y z 13 C 

64 21 16^11 6 0 



RAISES 
(nothing) 



BIT-^ND- 
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(BITJMPruu) ... i ji t; 07 OC ... A 

64 *r 42 3T 32 27 21 0 

t — - u\/ V 

(BITJMP.TEST tuv] ... t u v 07 Od A 

64 42 3: 12 2r 21 0 

i u V r 

These operations compute bitwise implication- 

BIT JMP.TEST never generates overflow/NaN or carry. 



RAISES 
(notlung) 

Bit Operations ^ BITJMP. 
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(BIT-LEFT.ONES x y) 

... z w 04 00 C 

X - niin{i|(y<,) > 0} " " i« u 6 0 

(BIT_LEFT.ONES.TEST x yV 

, ••■ r y 04 01 C 
X - minO|(y<t) > 0) " i« n « o 

(BIT J.EFT JEROS x y) 

... X y 05 00 C 
X - 64 - min{«|(y»,) = 0} " • » 

(BIT_LEFT_ZEROS.TEST x v) 

' ••• X y 05 01 C 

X ^ 64 - min{«((y»,) = 0} " • » 

wlTinT''"" '"^^ °' """'^^'^^ °' "-•'"^ - °f 

J^rf^^lv'""" °' ^^"""^ ^^>- is 64 and never generate 



EXAMPLES 

usitTri^ f ^^^^^"'^'iv*^'' f ^ °f ^"^'^^ P°i°^«d by p could be done 

using the loop shown below. The code returns the bit offset from the leftmost bit of the vector 
and assumes that a zero will eventually be found. 



(LABEL loop) 



(INST 0 (LOAD n p) (TARGETJDISP tJoop loop) (REGJ^OVE bn rO)) 

(INST 7 (INT_^DD JMM p p 8) (BITJ,EFT.ONES.TEST b n)) 
(INST 0 (LOAD n p) 

(INT_A.DD bn bn b) 
(JUMP IF.C cO tJoop)) 



RAISES 
(nothing) 

SEE ALSO 
BITJUGHT. 



BITJLEFT. 
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(BIT-MASK / top bot) ... t 3 top hot 00 A 

for 2 e [0 ... 63] : t, - ( 'top > ;) e (i > 'bot) 6 ( top > 'bot) 
{where 'top e [0 . . . 6.3). 'bot € [0 . . . 63]} 

A mask is generated that contains 1-bits from bit positions [bot . . . top] and 0-bits elsewhere. If top 
is less than bot then the bit positions set to 1 are [0 . . . top] and [60/ . . . 63], generating a complement 
mask. 



EXAMPLES 

A mask containing ones in bit positions [21 . . .46] and zeros elservhere is generated by (BIT-MASK 
t 46 21). Its complement is generated by (BIT-MASK t 20 47). 

RAISES 

(nothing) 
SEE ALSO 

INTJMM 



Bit Operations 
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... t u V 10 OC ... A 



(BIT_MAT_OR t u v) 

for I J e [0 . . . 7] : t,.,^^ - VLo(t'8..>* A v^^k^j) 
(BIT_MAT.TRANSPOSE t u) ... < u IB 00 08 A 

for j,j 6 (0...7] :t8.,+; -U8.>+, 
(BIT_MAT JCOR t u v) 

for i,j € [0 ... 7] : ts..^; - SLoCt^s-.+it a v8.*+,) 

These operations provide the basic support for multiply and transpose of bit matrices. Each byte 
of a word represents a row of an 8 x 8 matrix. Matrices of arbitrary size can be represented as 
matrices of 8 x 8 blocks. 



... < u V 11 OC ... A 

«« i7 4J J7 31 2T 21 0 



EXAMPLES 

The word 8040201008040201i6 is the identity matrix. For either BIT_MATJCOR or BIT-MAT.OR 

the matrix 0102040810204080i6 in u will reverse the bytes in v., leaving the bit order unchanged. 

The same matrix in v will reverse the bits in each byte of u, leaving the byte order unchanged. 
RAISES 

(nothing) 
SEE ALSO 

BIT.OR, BIT.^ND, BITJCOR 



BIT_MAT. 
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(BIT-MERGE t u v w) ... t u v w 26 A 

64 4: 42 :? 32 2? :j 0 

for I € [0 . . . 63] : t, - if u-, then u, else iv 
(BIT-MERGE-TEST f u r u) ... t u v w 27 A 

C4 4: 42 3r 22 2: 31* * * 0 

for 2 € [0 . . . 63] : i, - if it-, then else r, 

These operations select a bit from u if the corresponding bit in w is set; otherwise the bit from v 
is selected. 

BIT3IERGE_TEST never generates overflow/NaN or carry. 



RAISES 
(nothing) 



Bit Operations 
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(BITJSTAND tuv) , 

' . . . f u w 06 OC A 

t i(U A V) 

(BIT_NAND.TEST tuv) , 

' . . . ( u V 06 OD A 

t 1(« A V) 

These operations compute bitwise negated and. 
BIT_\AND.TEST never generates overflow/NaN or carry. 



RAISES 
(nothing) 

BITJN'AND. 
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(BIT_NIMP t u v) 

t — a A -If 

(BIT J^IMP xy z) 
(BIT-NIMP.TEST / u v) 

t U /K-^V 

(BITJ^IMP.TEST X y z) 

X y A ^z 

These operations compute bitwise negated implication. 
BITJNTMP-TEST never generates overflow/NaN or carry. 



... ^ I- 00 DC ... A 
64 47 42 :r 22 :i o 

... I y no C 

... ^ u r 00 CD ... A 

64 47 42 32 27 21 0 

... X y -11 C 

64 21 16*'ll 6 0 



RAISES 
(nothing) 
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(BITJSrOR f u u) . 

, , 64 47 42 37 32 2T 21 0 

/ ^{u V V) 

(BIT_NOR.TEST / u V) . . . ^ « t, 04 OD . . . A 

. ^ 64 47 42 37 32 27 21 0 

t ^ «(u V v) 



These operations compute bitwise negated or. 
BIT-NOR-TEST never generates overflow/NaN or carr>'. 



RAISES 
(nothing) 



BIT-NOR- 
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(BIT_ODD_A.ND t u v) . . . t u v 09 OC 

t - (-l)-e"o 

{BIT-ODD_A.ND.TEST t u v) ... : u v 09 OD 

63 



(BIT_ODD-NIMP t u v) . . . f u r 08 OC 

63 



(BIT-ODD_NIMP-TEST t u v) . . . f u r 08 OD . . . 

^ 64 <: 42 37 32 27 21 0 

63 



(BIT.ODD.OR f u u) , . , t u v OB OC . . . 

64 4: 42 37 32 27 21 0 

63 



(BIT«ODD-OR-TEST t u v) . . . r u r OB OD . . . 

64 47 42 37 32 27 21 0 

63 



(BIT.ODD-XOR f li f) , . . r u r OA OC . . . 

64 47 42 37 32 27 21 0 

63 



(BIT_ODD-XOR.TEST fur) ... f t/ r OA OD ... A 

64 47 42 37 33 37 31 0 

These operations do a two-operand bitwise operation and compute the parity of the result. The 
value stored in t is either all I's or all O's. 



RAISES 
(nothing) 

Bit Operations ^ BIT-ODD- 
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(BIT.OR t u v) 

t ^ u\/ V 
(BIT.OR X y z) 
x--yy z 

(BIT.OR_TEST i u v) 

uVv 

(BIT.OR.TEST x y z) 

x^yy X 

These operations compute bitwise or. 
BIT.OR^TEST never generates overflow/NaN 



. . . u f 03 OC . . . A 

64 4? 42 37 32 2? 21 0 

... z y ^ 16 C 

64 21 16''ll 6 0 

... ^ u r 03 OD ... A 

64 47 42 37 32 27 21 0 

... X y 217 C 

64 21 16^11 6 0 

carry. 



R.\ISES 
(nothing) 

BIT.OR. 
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21 



for 7 € [0.. .63] : r, - r. r, a (tally {-^[uj ...uq]) = - T')} 

These operations are used for packing bit fields from r into r under control of the mask found in u. 

Bits from r selected by zeros in u are packed consecutively into bit positions in the destination 
register t. starting from the right. Unfilled positions in t are set to zero. 



EXAMPLES 

This example only shows 8 bits, and describes the operations for BITJPACK. Capital letters stand 
for arbitrary bit values. 

BITJACK register what 



11011001 u source selector mask 

RSTVWXYZ V source 
OOOOOTXY t destination 



Bits T, X and Y axe selected by the source mask, and are v/ritten into the destination register i in 
that order, packed to the right, with zeros filled in on the left. 

RAISES 

(nothing) 
SEE ALSO 

BIT.UNPACKJ 



Bit Operations 



BITJ>ACK. 
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(BIT JIIGHT.ONES X y) ... r y 06 00 C 

r — mjn{2|(y>2) is even} 

(BIT_ElIGHT.ONES.TEST z y) . . . r y 06 01 C 

X — mjn{2|(y>2) is even} 

(BITJUGHT JEROS X y) ... xy 07 00 C 

i_64-mjn{z|(y<i) = 0} 

(BIT JUGHT JEROS.TEST x y) . . . x y 07 01 C 

I ^ 64 - min{i|(y<:) = 0} 

These operations respectively return the number of consecutive 1- or 0-bits on the right end of the 
word in y. ° 

The TEST versions of these operations generate carry when the result is 64 and never generate 
overflow/NaN. 



RAISES 

(nothing) 
SEE ALSO 

BIT_LEFT. 

BITJUGHT. 
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(BIT-TALLY / ii) . . . / u 00 OF OC . . . A 

{BIT.TALLY.TEST r u) . . . ^ u 00 OF OD . . . A 

64 4: 42 jr jj j: 21 0 



These operations count the number of 1-bits in the u register vaJue. The .TEST versions never 
generate overflow/NaN and generate carry when f = 1. 



RAISES 
(nothing) 

Bit Operations BIT.TALLY. 
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(BIT.UNPACKJ t u v) 

for2€ (0...63){ 

shift, - ! -f. 1 _ tally (-.(1^, . . . uq]) 
s, — shift, A 63 
di — shifti A 15 

} 

(BIT_UNPACKJ5 t u v) 

for z 6 (0...63]{ 

shifti - t + 1 - taUy (-[u, . . . 
Si — shifti A 15 
di ^ shifti A 3 



. . . t u V 15 OC ... A 



... « u t; 16 OC ... A 

4 4T «J 3? 32 JT 2) 0 



} 



(BIT.UNPACK_3 r u i>) 

for 2€ [0...63]{ 

i/ii/t,- - i + 1 - taUy (^[u, . . . uo]) 



... « u u 17 OC ... A 

»4 47 42 37 32 27 21 0 



} 



Si ^ ^/ii/t, A 3 
di — shift, A 0 
ti-d, A v,_,, 



The BIT.UNPACKa BIT.UNPACKJ2, BIT.UNPACKJ operation sequence is used for unpacking 
bit data m v into result t under control of the mask found in u. 6 

RT'^TTMwr.^f]' "nr^^' of BIT.UNPACK.1 as the input data for BIT.UNPACKJJ (and 
BIT UNPACKJ2 for BIT.UNPACKJ), bits from v are packed consecutively into bit positions in 

'tt"'"^ '° ^'^^^^^"^^ P°=^^^°"^ i° ' ^« ^« »o ^"o- Extra 

bits from V are discarded. The operation packs from right to left. 

EXAMPLES 

'^L'^'^r' ^ operations for the three-operation bit unpack 

sequence. Capital letters stand for arbitrary bit values. 

bit unpack register what 



01001110 
RSTVWXYZ 
WOXYOOOZ 



u 
f 
t 



destination selector mask 

source 

destination 



by Jhe aJk^k if destination register t in that order, but in positions selected 

RAISES 

(nothing) 
SEE ALSO 

BITJ>.A.CK 



BIT.UNPACK. 
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(BITJCNOR f u r) . . . r u r 05 OC . . . A 

(BITJCNOR-TEST t u v) , . . r u r 05 OD . . . A 

^ 64 4? 42 z: 32 :: o 

These operations compute bitwise negated exclusive-or. 
BIT-XNOR-TEST never generates overflow/XaN or carry. 



RAISES 
(nothing) 



Bit Operations 
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(BITJCOR t u u) _ f u V 02 OC ... A 



6« 4: 42 37 32 2T 21 0 



(BIT JCOR r y .-) ... x y z 14 C 

I — y 9 ^ 



64 31 ]6 11 6 0 



(BIT_XOR_TEST t u f) ... r « r 02 OD A 

64 47 42 37 32 27 21* " 0 



(BIT JCOR.TEST x y z) . . . x y . 15 C 

64 21 16^12 6 0 

X ^ yQ 2 

These operations compute bitwise exclusive-or. 
BITJCOR-TEST never generates overflow/NaN or carry. 



RAISES 
(nothing) 



BIT JCOR- 
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(BREAK) ... 00 01 00 OO 00 ... A 

u *' i2 3? 32 2: o 

Raise privileged operaiion exception 

This operaiion raises a privileged operation exception. It may be used by the debugger to implement 
breakpoints. 



RAISES 
privileged 



Clock Operations 



BREAK 
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Pr'^cessor Principles of Operation 



(CLOCK ry) x y 19 00 C 

«« 31 16 11 6 0 

X — clock - y 

This operation returns the contents of the 64-bit clocic register, which increments by one on each 
clock tick. Normally, the clock register contents are synchronized across all processors in a system. 



RAISES 

(nothing) 
SEE ALSO 

§10.3 

CLOCK 
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(COUNT.CONCURRENCY r) . . . f 0 IB 13 08 . . . A 

64 4" a :r 3- 2: g 

t — (concurrency counter Iq 
(COUNTJEVENTS i CO . . . r 0 IB 5 ec 08 . . . A 

64 «r 4; 27 23 2: 2: 0 

/ — (event counter at eclp 
(COUNT JSSUES 0 ... r 0 IB 10 08 ... A 

^ 64 <: 42 2: 22 2: 21 0 

t — (instruction issue counter 
(COUNT-MEMREFS 0 ... r 0 IB 11 08 ... A 

64 4: 42 32 2: 21 0 

t — (memory reference counter)^ 
(COUNT-STREAMS 0 ... r 0 IB 12 08 ... A 

64 4: 42 3T 32 27 21 0 

t (stream counier)^ 

These operations read one of the eight counters in the protection domain D of the executing 
stream. The four event counters each can be set independently to one of the count sources by the 
supervisor-privileged COUNT-SELECT JIESTORE operation. Each event counter has an eight-bit 
CountSource tag that determines what is to be counted. These tags are packed in the count-select 
register. A description of the counters and the encoding of the CountSource tag is described in 
§10.2. 



RAISES 

(nothing) 
SEE ALSO 

COUNTJELECTJIESTORE, CLOCK, STREAM.COUNTJNST 



Data Memory Operations 
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P- -essor Principles of Operation 



(COUNT-SELECTJIESTORE u) ... 0 0 u 00 10 OA A 

64 47 44 42 27 Z2 27 21* ' ' 0 

(event counter selectj^^ — u 
(COUNT^ELECT_SAVE i.) . . . < 00 IB IC 08 A 

«« t- 42 Jr 3J J7 21* " 0 

i — (event counter selectjjj 

These operations allow the event counter select register to be read and written. The COUNT.- 
SELECTJIESTORE operation requires supervisor privilege. This register contains the four select 
tags for the programmable event counters, described in §10.2. 



RAISES 

privileged 
SEE ALSO 

COUNT. 

COUNT^ELECT. 



Operation Descriptions 



79 



(COUNT-PHANTOMS 0 

{ — (processor phantom counter) 



64 



4" i2 :: 3: 



r 0 IB 18 08 



e 



A 



(COUNTJIEADY /) 



64 



42 3- 22 2r 



/ 0 IB 19 08 



0 



A 



t — (processor ready counter) 

These operations read the processor's phantom and ready counters. The phantom counter sunas 
the number of ticks where no stream was ready, so that the processor utilization may be measured. 
The ready counter sums the number of ready but not issuing streams at each tick, so that the 
average ready pool size and average waiting time may be measured. 



RAISES 



(nothing) 
SEE ALSO 



CLOCK 



Data Memory Operations 



COUNTJPROC. 
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Pr'^ressor Principles of Operation 



(DATAJVIAP_FLUSH 5) 

flush (data map at 5) from data map cache 
(DATAJVIAP_FLUSH-A.NY s) 

flush any (data map at s) from data map cache 



6« 



0 s F 



0 5 F 



2> 



21 



00 00 03 02 



00 00 02 02 



MC 



MC 



These are supervisor-privileged operations to maintain consistency in the data address translation 



The domain in Bits 63-60 and segment in Bits 41-28 of s address the data map entrj'; other bits 
of s are ignored. A violation of the map limit will not raise a data map limit exception. The 
DATA-MAPJIUSH operation is used to flush a single map entry, as after changing the data map 
in data memory. The DAT-\_MAP_FLUSHjVNY operation is used to flush any map entry for the 
speciiied domain from the cache. Since the cache is not fully associative, up to 512 flushes' may be 
required— each flush should specify a different segment modulo 512. See §6.2. 

These operations are subject to the min-dkill level (see §8.4) based on the domain. Therefore, it 
may be necessary to perform a DATA_STAT£JtESTORE on the specified domain prior to flushing. 



cache. 




DATA-M.^P. 
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f DATA_0 PA-SAVE r apno) ... r 00 F . . . 00 00 0 opno 04 MC 

r — address state of operation opno 

(DATA.OPD^SAVE r opna) ... r 00 F ... 00 00 1 opno 04 MC 

64 cj 56 s: 47 :: 16 9 ^ c 0 

r — data state of operation opno 

(DATA-OP -REDO r s) ... r s F ... 00 00 10 04 MC 

64 61 £6 51 4r 2i 16 11 6 0 

perform memory operation in s with data to or from r 

These operations save and (rejexecute failed memory references. They are normally used by trap 
handlers. A stream may have eight memory references pending in the M-unit. Each reference is 
described by two words. One word contains the data value (if any), and the other contains address 
and control information. The data value must be read after the address word. The eight data 
result fields of the result code register describe which of these memory references are exceptional 
after a trap. 

DAL\-OP.A^AVE retrieves a Data Control Descriptor containing address and control information 
from the M-unit for the operation denoted by opno and places that information into register r (see 
§6.3). To allow an opno of zero to select the descriptor which must be saved first, the lookahead 
index is added to opno modulo eight. DATA-OPD^AVE retrieves the corresponding data from 
the M-unit for the operation denoted by opno and places that information into register r. Note 
that DATA-OPAJAVE must be performed before DATA.OPD-SAVE for each opno. 

The DATA-.OP-REDO operation re-executes an M-unit operation, given a Data Control Descriptor 
in register s and the corresponding data in register r. The ''original'' register number in field ''dest.- 
reg'' of 5 is ignored; a load operation will place its result in register r. DATA-OP JIEDO raises data 
memory exceptions (and otherwise behaves) just as the '^original" operation, as described by r and 
s, would have. If the value in 5 does not indicate one of the defined operations for the Data Control 
Descriptor, a data.prot exception is raised, and the result code is set to DR.UNIMPLEMENTED - 
OP. 

Since the value of s must be interpreted before it is known whether the DATA.OP -REDO operation 
will actually read or write r, poison is checked for reading r. 



RAISES 

data.prot, data-alignment, data-blocked 
SEE ALSO 

EXCEPTION., RESULTCODE., §6.3 



Data Memory Operations 



DATA.OP. 



82 



P" -issor Principles of Operation 



00 00 04 02 MC 

«4 «t it a <: 21 )6 11 6 0 



(DATA-STATE-RESTORE s) ... 0 s F 

(data state descriptor for domain in s) ~ s 

This supervisor-privileged operation is used to set the data state descriptor. 

The value 5 is the data state descriptor: see §8.4. The Bits 63-60 of 5 specify the protection domain, 
i.e. the data state descriptor is self-tagged. 



RAISES 

privileged 
SEE ALSO 

PROGRA.NLSTATE. 

DATA-STATE. 
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(DOMAIN -ENTER) . . . 0 0 00 00 OD OA . . . A 

if limbo > 0 then 

limbo — limbo - 1: 
SRESd — SRESd -f 1: 
5CUR£) — SCl"R£) -i- 1: 

else 

raise create exception 

end 

(DOMAIN-LEAVE u) . . . 0 0 w 00 OC OA . . . A 

^ 64 4T 44 42 37 32 2? 21 0 

limbo — /im6o+ 1; 
SCVKd — SCURp - 1; 
SRES£) — sresd - 1; 
D — u; 

if /imfeo >= 128 then 
raise create exception 

end 

These are a supervisor-privileged operations to change protection domains. No lookahead is allowed 
across a DOMALN JLEAVE/DOMAIN-ENTER pair. 

Executing a DOMAIN -LEAVE, DOMAIN JENTER sequence will change the domain to the specified 
value. The protection domain D of the stream changes to that specified in Bits 3-0 of register u. 
The current pc address in the ssw must map to the ssime page in both the new and old domains. 

The create exception protects against a DOMAIN JiNTER without a matching DOMAIN -LEAVE, 
or too many domain changes in progress at once. 



RAISES 

privileged, create 
SEE ALSO 

LEVELJITN 

Domain Operations DOMAIN- 
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P' '^ssor Principles of Operation 



(DOMAIN JDENTIFIER^AVE 0 . . . f 08 00 00 00 . . . A 

64 47 42 27 32 2T 21 0 

t — D. the current protection domain identifier 

This is a supervisor-privileged operation used to retrieve the identity of the current protection 
domain D. 



RAISES 
privileged 

DOMAIN JD. 
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(EXCEPTION-RESTORE ui 



0 0 00 09 OA 

■ 44 42 3: 22 27 



0 



A 



EXCEPTION — U 



(EXCEPTION^AVE r) 



64 



21 16 n 6 



X 00 IC 00 



0 



c 



2 — EXCEPTION 

These operations manipulate the exception register, which contains the exception bits that record 
unusual and possibly significant events resulting from instruction execution. Every exception bit 
causes a trap unless that trap is disabled by the appropriate bit in the trap mask of the ssw. 

Additional information about the most recent floating-point and memory exceptions are found in 
the result code register. 



RAISES 



(nothing) 
SEE ALSO 

RESULTCODE^ §9.1 



Float Operations 



EXCEPTION- 
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(FLOAT-CEIL r u) . . . r u 18 03 08 . . . A 

64 47 42 3~ 32 27 21 0 

t — (float ceiling of float u) * 2"^°'^ 
(FLOAT_CHOP r ti) ... r li 18 01 08 ... A 

64 47 42 3T 32 37 21 0 

t — (float integer chop of float u) * 2~^°"^ 
(FLOAT-FLOOR t u) ... t u 18 02 08 ... A 

64 47 42 37 32 27 21 0 

t ^ (float floor of float u) * 2"^°^^ 
(FLO AT JSTEAR t u) ... i u 18 00 08 ... A 

64 47 42 37 32 37 22 0 

/ — (float integer nearest float u) * 2"^°^"* 
(FLO AT JIOUND t u) . . . f u 18 OC 08 . . . A 

64 47 43 37 32 27 21 0 

t — (float integer round of float u) ♦ 2"^°^^ 

These operations scale 64-bit floating-point numbers by 2'^^^^, then round the result into floating- 
point numbers. Since the scaling reduces 1.0 to the minimum denormalized number, the effect 
is to round the argument to an integer. The roundings are directed as in IEEE Standard 754. 
FLOATJIOUND uses the rounding mode in the ssw. 

Note that the float Jnexact exception is never raised, since it is expected for these operations. 



RAISES 
(nothing) 

SEE ALSO 
INT^ UNS. 



FLOAT. 
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(FLOAT-ADD rut) . . . f u r 10 OE . . . A 

t — u + V. floating point 

(FLOAT-A.DD X y z) ... x y rOC C 

6« 31 16 i: 6 0 

I — y + X, floating point 
Tliis operation computes the floating-point sum of two numbers. 



RAISES 

float jnvalid, float-overflow, float inexact 
COUNTS AS 

CNT_FLOAT-ADD, CNT.FLOAT.TOTAL 
SEE ALSO 

FLOAT JIDDJ^UL 



Float Operations 



FLOAT-ADD- 
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Prr -ossor Principles of Operation 



(FLOAT^DD-MUL t u v w) t u v w 30 A 

64 47 42 3" 32 2T 21 0 

i — u ''r V * ii\ floating point 

This operation does a floating-point multiply followed by a floating-point add, counting as two 
floating-point operations. If u is register 0. only a floating-point multiply is performed (to preserve 
the sign of a zero result). 

Onlv one rounding operation is performed, enhancing accuracy and greatly facilitating doubled 
precision operations. 



RAISES 

floatinvalid, float-overflow, floatJnexact, float-underflow 
COUNTS AS 

CNT JLOATJvIUL, CXT_FLOAT_A.DD if u ^ 0, CNT-FLCAT.TOTAL 
SEE ALSO 

FLOAT_\DD, FLOAT.SUB-MUL, FLOAT_SUB-MUL_REV. INT_\DDJ^IUL, §12.4 
FL0AT-\DD31UL. 
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{FL0AT_.4PPR0XJlESTORE y) ... 00 y 01 00 C 

index — y5i...45 
for ? € [0...7] : 

rbase, — y2.i 

qbase, — y2.,-ri 
for 2 e [8...15j : 

rbase,- — ^2-1+2 

qbase,- — yj-t+a 
rbaseie — yae 
qbase^e — ^37 
for : € [0...2] : 

rslope,- — y2..+38 

qslope,- — y2...r39 
for2 € [3. ..6] : 

rslope, — y2-»>46 

qslope- — ^2.1+47 
recip base-^^^^ — rbase 
rsqrt base,^^^^ — qbase 
recip slope,'^^ex ^ rslope 
rsqrt slope,-^^^, — qslope 

{where yie = par2ty(rbas€o.,7), 3/17 = pa'"^^y(qbaseo..7), 
^34 = Pari<y(rbase8..i5),y35 = parzty(qbase8,j5), 
j/60 = part<y(rbasei6|rslope),y6i = parity(qbasei6|qslope)} 

This IPL-privileged operation is used to initialize the floating-point reciprocal and reciprocal square 
root approximation tables. - 



RAISES 

(nothing) 
SEE ALSO 

FLOATJIECIP JIPPROX, FLOAT JISQRTJIPPROX 

noat Operations , FLOATJ^PPROX-RESTORE. 
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P'-'- "ssor Principles of Operation 



(FLO AT-CMP .TEST tut) ... t u v 11 OF A 

t — u - V. floating point 

(FLO ATXMP.TEST x y z) ... x y :0? C 

X — y - s, floating point 

This operation computes floating-point comparison. 

FLOAT.CMP_TEST generates overflow/NaN if either operand is NaN, and raises floatinvalid. 
The condition code is zero if the two operands are equal, negative if the first (t< V y) is less than 
the second (r V 2), and positive if the first is greater than the second. Carry is set if and only if 
V V r is NaN. 



RAISES 

floaiJnvalid 
COUNTS AS 

CNT_FLOAT_A.DD, CNT-FLOAT.TOTAL 
SEE ALSO 

FLOATJ.IAX.TEST. FLOAT_MIN_TEST, §5.1 



FLOAT.CMP. 
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(FLOATJDIV r u r u;) t u v w IZ ... A 

64 4r 42 2' 22 27 J: 0 

t — u -f ij * ti\ floating point 

This operation is used to complete the floating-point division of u by r. The reciprocal in xv is a 
SpecjalFloai64 as delivered by FLOATJTER. The inexact exception arising from the division of 
two floating-point numbers will be raised by this operation. 



RAISES 

floatjoverfiow, fioat.underflow, float unexact 

COUNTS AS 

CNT-FLOATJ)IV, CNT-FLOAT.TOTAL 
SEE ALSO 

FLOATJ)IVJU>PROX, §12.5 



Float Operations 



FLOATJ)IV 
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Prr'^ssor Principles of Operation 



(FLOAT_DIV_APPROX t u v w) ... t u v w 3k ... A 

64 47 43 37 32 27 21 0 

exp — unbiased exponent of u: 

t — V* u;/2*^~'*^, floating point, round to nearest 

(FLOAT^QRT^PPROX_TEST r 11 t; ti;) ... / li u u' 3B ... A 

64 47 42 37 32 27 21 0 

exp — unbiased exponent of u; 

t — r » w/2^^^^^^i, floating point, round to nearest 

These operations perform a floating-point multiply of SpeciaIFloat64 w with Float64 u, with round 
to nearest. They are used for floating-point division and square root computation. A floatJnv-alid 
exception is raised for 0/0. infinity/infinity, and square root of negative. A floatJnexact exception 
is raised only in conjunction with floatjoverflow. The condition code produced bv FLOAT-SQRT.- 
APPROX.TEST is undefined. 



RAISES 

floatinvalid, float-zero.divide, float.overflow, float inexact 
COUNTS AS 

CNTJLOAT-TOTAL 
SEE ALSO 

FLOATJ)R\ FLOAT^QRT, FLOATJTER, §12.5 



FLOATJ)I\^JVPPROX 
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(FLOATJDIVJERROR t u v w) ... t u v w 3C A 

64 *r i7 :t 32 27 21 0 

exp — unbiased exponent of r: 

/ — (li - i; * u;)/2"^"^^, floaiing point, round to nearest 
(FL0AT^QRTJE:RR0R.TEST tuvw) . . . / 1/ r u; 3D . . . A 

64 47 42 :r Z7 27 21 0 

exp — unbiased exponent of u: 

t — 0.5 * (i/ - r * ti;)/2l"P/2J. floating point, round to nearest 

These operations perform a floating-point subtract of u with the product of v and u\ using round 
to nearest. The result is scaled to avoid undesirable overflow or underflow. They are used for 
floating-point division and square root computation. 

FLOAT-DIV J;RR0R returns a zero with the sign of u when u is infinity or NaN, or r is infinity, 
NaN, or zero, or xv is infiDity or NaN. When the rounded result would be zero and inexact, it 
is rounded away from zero to produce the minimum denorm floating-point number. FLOAT-- 
SQRTJERROR-TEST returns a positive zero when v is infinity, NaN, or zero. The condition code 
produced is undefined. 



RAISES 

(nothing) 
COUNTS AS 

CNTJLOAT.TOTAL 
SEE ALSO 

FLOAT_DIV, FLOAT.SQRT, FLOATJTER, §12.5 



Float Operations 
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P- '^ssor Principles of Operation 



(FLOATJNT t 1/) / u 18 IE 08 .. A 

64 47 43 3r 33 37 31 0 

t — float of integer u 

This operation converts an integer into a floating-point number, rounding according to the current 
rounding mode in the ssw. 



RAISES 

float Jnexact 
SEE ALSO 

FLOAT.VNS 

FLOATJNT, 
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(FLOATJTER f 1/ r u;) ... t u v w 38 A 

64 i: 42 32 2T 2J 0 

/ — u + V * u;, floating point, round to nearest 

In use. u and w are an extended precision (SpecialFloat64) reciprocal whose accuracy is increased 
by cancelling out the relative error given by the floating-point number r. The result is stored as a 
SpecialFloat64. FLOATJTER is used in both floating-point and integer division and square root 
computations. 



RAISES ; 

(nothing) 
SEE ALSO 

FLOATJIECIPJ^PPROX, FLOATJ)IVjlPPROX, §12.5 



Float Operations 



FLOATJTER. 
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° -cesser Principles of Operation 



(FLOAT_MAX t u v) ... t u v 13 OE .. A 

64 47 42 37 33 27 21 0 

t — inax(u, v), floating point 

(FLOAT^AX.TEST t u v) ... ^ u r 13 OF A 

t — max(u, r), floating point 

These operations select the larger of the two floating-point operands. If both operands are NaN, u 
is selected; if only one is NaN, the other (non-NaN) operand is selected. If both operands are zero, 
a positive zero is selected if one is present. See §5.1. 

FLOATjMAX.TEST generates overflow/NaN if either operand is NaN. The condition code is zero 
if the two operands are equal, negative if the first (u) is less than the second (v), and positive if 
the first is greater than the second. Carry is set if and only if v is NaN. 



RAISES 

(nothing) 
COUNTS AS 

CNTJLOAT-TOTAL 
SEE ALSO 

FLOAT_MIN, SELECT-FLOAT, LNT_MAX 



FL0AT31AX_ 
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(FLOATJVIIN t u v) 

t — miii{u,v), floating point 



64 



47 4: 3r 32 2: 



^ li r 12 OE 



21 



0 



A 



(FLOATJVIIN.TEST t u v) 



64 



47 42 37 32 27 



i U V 12 OF 



0 



A 



t — min(iz,r). floating point 

These operations select the smaller of the two floating-point operands. If both operands are NaN, 
u is selected; if only one is NaN, the other (non-NaN) operand is selected. See §5.1. 

FLOATJvIIN.TEST generates overflow/NaN if either operand is NaN. The condition code is zero 
if the two operands are equal, negative if the first (u) is less than the second (v), and positive if 
the first is greater than the second. Carry is set if and only if t; is NaN. K both operands are zero, 
a negative zero is selected if one is present. 



RAISES 

(nothing) 
COUNTS AS 

CNTJLOAT-TOTAL 
SEE ALSO 

FLOAT.MAX, SELECT-FLOAT, INTJvflN 

noat Operations , FLOATJ^IN. 
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P- '^ssor Principles of Operation 



(FLOATJMMAX t u v) ... t u v IS 01 ... A 

64 47 43 37 33 37 31 0 

t — if abs(u) > abs(v) then u else v end, floating point 
(FLOATJVlMAX.TEST t u v) ... f u l- 15 OF A 

64 47 43 37 33 37 3l" ' "o 

i — if abs{u) > abs{v) then u else v end, floating point 

These operations select the larger in magnitude of the two floating-point operands. If both operands 
are NaN, u is selected; if only one is NaN, the other (non-NaN) operand is selected. If the operands 
have equal magnitude, u is selected. 

FLOAT-MMAX.TEST generates overflow/NaN if either operand is NaN. The condition code is 
zero if the two operands are equal in magnitude, negative if the first (u) is smaller than the second 
(t;), and positive if the first is larger than the second. Carry is set if and only if v is NaN. 



RAISES 

(nothing) 
COUNTS AS 

CNT-FLOAT.TOTAL 
SEE ALSO 

FLOAT^MMIN, SELECT JLOAT, FLOATJ^AX 
FLOAT-MMAX. 
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(FLOAT_MMIN / u v) 

t — if ab$(u) < abs{v) then t/ else t- end, floating point 



t u V li OE 



A 



64 



0 



(FLOAT_MMIN_TEST / u v) 



t u V 14 OF 



A 



64 



47 42 37 32 27 



0 



t r- if abs{u) < abs{v) then u else r end, floating point 

These operations select the smaller in magnitude of the two floating-point operands. If both 
operands are NaN, v is selected: if only one is NaN, the other (non-NaN) operand is selected. If 
the operands have equal magnitude, v is selected. 

FLOATJ^IMIN.TEST generates overflow/NaN if either operand is NaN. The condition code is zero 
if the two operands are equal in magnitude, negative if the iirst (u) is smaller than the second (v), 
and positive if the first is larger than the second. Carry is set if and only if v is NaN. 



RAISES 

(nothing) 
COUNTS AS 

CNTJLOAT.TOTAL 
SEE ALSO 

FLOATJvIMAX, SELECT-FLOAT, FLOATJ^ 

noat Operations , FLOATJvIMIN. 
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-essor Principles of Operation 



... t u V w 3A A 

64 47 42 3T 32 27 21 0 

This Operation performs a floating-point multiply foDowed by a floating-point subtraction. It differs 
from FL0AT-5UB-MULJIEV in the response to exceptions. In particular, when u. r, or w is 
infinity or NaN, the result is zero with no exception. Normally, an infinity or NaX would be 
returned and an exception possibly raised. In particular, if t? ^ ti; rounds to infinity and u is that 
infinity, 0 rather than NaN is generated in t and no exception is raised. 

Only one rounding operation is performed, enhancing accuracy and greatly facilitating doubled 
precision operations. 



(FLOATJMULJLOWER t u v w) 
t ^ V * w - u, floating point 



RAISES 

float,underflow. float-overflow, floatJnexact 
COUNTS AS 

CNT JLOATJvIUL, CNT-FLOAT-\DD if # 0, CNT-FLOAT.TOTAL 
SEE ALSO 
§12.4 



FLOAT-MULJ-OWER- 
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Pr'^r.essor Principles of Operation 



r y OD 00 
21 u^n 6 0 



(FLO AT JlECIP^PPROX z y) . . . x v OC 00 

exp — unbiased exponent of y; 
X — an approximation to 2^^*^^^: floating point 

(FLOATJIECIP^PPROX^TEST I y) ... x v OC 01 

exp — unbiased exponent of y; 
z ^ an approximation to 2"'^^5Vy» floating point 

(FLOATJEISQRT^PPROX x y) 

6 

exp ^ unbiased exponent of y; 
X ^ an approximation to 2^''^f^i/y/y^ floating point 

(FLOATJlSQRT^PPROX.TEST x y) . r v CD 01 C 

•4 Jl It'll t 0 

exp «- unbiased exponent of y; 

I an approximation to 2^*'f/^i/^, floating point 

These operations are used for computing floating-point reciprocals and reciprocal square roots. 
They perform a table lookup operation foDowed by a linear interpolation using an adder-multiplier. 
The table is in an internal format. The approximation is returned as a SpecialFloat64. 

In FL0AT.*_A.PPROX, if the y operand is denormaiized, the float.extension exception is raised 
and y is returned. When the y operand is denormaiized with FLOAT.* _A.PPROX_TEST, they 
return y, set carry, and raise no exception. In either case, if the y operand is zero, y is returned. 



RAISES 

float.extension 
SEE ALSO 

§12.5 



FLOAT_APPROX_ 
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(FLOATJLECIP J:RR0R t L- u) . , . t IE v w 00 . , , A 

64 47 42 Z7 32 2: 21 0 

exp — unbiased exponent of r; 

r — 1.0 - v * floating point, round to nearest 

(FLOATJISQRTJERROR.TEST ^ u r u) ... / u t u 3F ... A 

64 4: 42 37 32 27 21 0 

exp — unbiased exponent of u: 

f — 0.5 * (1.0 - r * tL72L"^/^J), floating point, round to nearest 

These operations are used for computing floating-point reciprocals and square roots. They perforni 
a partial Xemon's method iteration using the adder-multiplier, returning the relative error in the 
SpeciaIFloat64 reciprocal or reciprocal square root in w compared to the Float64 divisor or square 
root estimate in v. The condition code produced by FLOATJISQRTJERROILTEST is undefined. 



RAISES 

(nothing) 
COUNTS AS 

CNTJ^LOATJOTAL 
SEE .USO 

INT-RECIP-ERROR, §12.5 



Float Operations 



FLOAT.ERROR. 
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(FLOAT^CALB t v w) t IS v w 00 A 

«4 4T 42 JT J3 JT 21 0 

t — v*'}""', floating point 

This operaiion is used to multiply the floating-point number v by a power of two selected by the 
signed integer in the low 13 bits of u;. . 



RAISES 

float jjverflow, float_underflow, floatinexact 
COUNTS AS 

CNTJPLOAT.TOTAL 
SEE ALSO 
INTJLOGB 

FLOAT-SCALB 
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{TLOAT^QRT t u V w) t u v w IT ... A 

64 4T 42 2T 32 21 0 

/ — u V * w. floating point 

This operation is used to complete floating-point square root of the floating-point number in u, 
using the SpecialFloat64 reciprocal in u*. The inexact exception arising from computing the square 
root of a floating-point number is raised by this operation. 



RAISES 

floatinexact 
COUNTS AS 

CNTJ^LOAT^QRT, CNT-FLOAT-TOTAL 
SEE ALSO 

FLOATJISQRTJ^PPROX, §12.5 



Float Operations 



FL0ATJ5QRT 
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P^- cesser Principles of Operation 



(FLOAT^UB i u v) 

t ^ u - i\ floating point 
(FLOATJSUB xyz) 

X — y - floating point 
This operation computes floating-point subtraction. 



... f u r 11 OE ... A 

64 47 42 3T 32 27 23 0 



...xyzOE C 

64 22 26^12 6 0 



RAISES 

float JnvaJid. float.overflow, floatinexact 
COUNTS AS 

CNTJFLOAT.\DD, CNT JLOAT.TOTAL 
SEE ALSO 

FLOAT^UB_MUL, FLOAT^UB-MULJIEV 



FLOAT^UB- 



Operation Descriptions 



107 



(FLOAT_SUB_MUL t u v u-) ... t u v v: 32 ... A 

64 47 42 ;t j: ji c 

t — ti — V * xv. floating point 
(FLOAT^UB-MULJEIEV t u v w) . .. t u v u- 36 . . . A 

«4 4: «J 37 ii 2: 2J 0 

t — V » w - u, floating point 

These operations perform a floating-point multiply followed by a floating-point subtraction. If « is 
register 0. only a floating-point multiply is performed (to preserve the sign of a zero resuJt). 



RAISES 

floatinvalid, float-overflow, float inexact, floatjunderflow 
COUNTS AS 

CNT JLOATJvIUL, CNT J"LOAT_'^DD if « # 0, CNT_FLOAT.TOTAL 
SEE ALSO 

FLOAT_ADDJ^UL, FLOAT^UB 



Float Operations 



FLOAT-SUB JvlUL. 
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Fm-^ssoT Principles of Operation 



(FLOAT.UNS / «) ... , , ,8 IF 08 ... A 

«« 47 42 ;r J3 37 Jl 0 

t — float of unsigned integer u 

This operation converts unsigned integers into floating-point numbers, rounding according to the 
current rounding mode in the ssw. 



RAISES 

floatJnexact 
SEE ALSO 

FLOATJNT 

FLOAT.UNS. 
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(INT.CEIL r u) ... r u 18 07 08 ... A 

64 i: 42 3r 22 2: 21 0 

t — ceiling of float u 

(INT-CEIL.TEST ? li) . . . ? u 18 07 09 . . . A 

64 4: 42 Zr Z2 27 21 0 

/ — ceiling of float u 

(INTXHOPfu) ... f « 18 05 08 ... A 

64 47 42 37 32 27 2: 0 

t — integer chop of float u 
(INT.CHOP.TEST ^ u) ... f li 18 05 09 ... A 

64 47 42 37 32 27 21 0 

t integer chop of float u 
(INT_FLOOR i li) ... t u IB 06 08 ... A 

64 47 42 37 33 27 21 0 

t — floor of float u 

(INT JFLOOR^TEST i li) ... / u 18 06 09 ... A 

64 4 7 42 37 32 2T 2: 0 

t — floor of float ii 

(INTJVEARiu) ... f V 18 04 08 ... A 

64 47 42 37 37 27 2) 0 

t ^ integer nearest float u 
(INT_NEAR.TEST i u) ... f u 18 04 09 ... A 

64 47 42 37 32 27 21 0 

t *- integer nearest float u 
(INT JIOUND i li) ... f „ 18 OD 08 ... A 

64 47 42 37 32 27 , 22 0 

t ^ integer round of float u 
(INTJIOUND.TEST t u) ... f u 18 OD 09 ... A 

64 47 42 37 32 27 21 0 

i integer round of float u 

These operations convert floats into signed integers. The roundings are directed as in IEEE Stan- 
dard 754. INTJIOUND uses lae rounding mooe in the ssw. 

A floatinvalid exception is raised when the result is not a representable signed integer. In these 
cases the result is reduced modulo 2^. 

The -TEST versions of these operations never generate carry or overfiow/NaN. 



RAISES 

float invalid, floatJnexact 
SEE ALSO 

FLOATJNT, FLOAT-, UNS. 



Integer Operations 



INT. 



no ^ • P-w, 



cessor Principles of Operation 



(INTJiDBtuv) ... , „ OE .. 

64 47 42 37 32 27 21 0 

f — ti + integer 



(INTJUDDxyz) ...ryz24 C 

64 21 16^11 6 0 

X ^ y + 2, integer 



(INT^D.TEST i « «) ... r u r, IC OF 

64 47 42 37 32 27 21* " 0 

t u + V, integer 



(INT_ADD.TEST r y .) ... r y z2S C 

X y + z, integer 

These operations perform two's-complement and unsigned integer addition. 

The resulting condition code from the .TEST version of this operation has its two's-complement 
definition. 



RAISES 

(nothing) 
SEE ALSO 

INTJIDDJMM 

INTj^DD. 
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riNT-ADD JMM f u uc/ue) ... t u 0 bvalue 20 ... A 

f — u + Talue. integer 

{where 'value € (1 . - • 512]. 'bvalue = 'value - 1} 

fINT-ADD-IMM J y i?fl/ue) ... x y bvalue OA C 

^ Cf / 64 21 16^11 6 0 

X — y + 'value, integer 

{where 'value € [1 ... 32], 'bvalue = 'value - 1} 

(INT-ADD JMM.TEST ^ u uaiue) ... t u 0 bvalue 21 ... A 

^ ^ 6i 47 42 37 36 27 21 0 

t — u + 'value, integer 

{where 'value € [1 . . . 512], 'bvalue = 'value — 1} 

(INT-ADD JMM.TEST r y ua/ue) ... x y bvalue OS C 

^ / 64 21 16^11 6 0 

X — y + 'value, integer 

{where 'value € [1 ... 32], 'bvalue = 'value — 1} 

These operations effectively add a constant between 1 and 32(512) to y{u), storing it in x{t). 

The resulting condition code from the -TEST version of this operation has its two's-compiement 
definition. 



RAISES 

(nothing) 
SEE ALSO 

INT-\DD, INT^UB JMM 

Integer Operations . INT J^DD JMM. 
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:essor Principles of Operation 



64 47 43 37 33 37 3] 0 



... t U V W 29 , 
64 47 42 37 32 27 21 0 



(INT^DJVIUL tuvw) .. ./..«..v.u;..28 

t — u -i- V* w, integer 

(INT_ADD_MUL-TEST t u v w) 
t — u T r * tt', integer 

These operations perform two's-complement multiplication and addition. A multiply is accom- 
pushed by letting u be register 0. 

The .TEST versions of these operations never generate carry or overflow/NaN. despite the fact 
that the multiply or the add mdght overflow. 

If r or is outside [-2» ... 2^3 - i], the float.extension exception is raised and the result in t may 
be mcorrect. 



RAISES 

float .extension 
SEE .A.LSO 

UNS-\DD.MUL.UPPER, INT_^DD, INT^UBJiIUL, INT^UB J^UL-REV 



INT_^DDJV1UL. 
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(INTJDIV.CHOP f u r U-) ... i v i- IB . A 

exp — unbiased exponent of w 
temp — V * round to zero 

i — iemp * 2"^. round to zero 

(INT J3IV.CH0P.TEST t u v w) ... t v v is A 

64 4T 42 jr 32 27 2J* * * 0 

exp — unbiased exponent of w 
temp — r * round to zero 

i — temp * 2^^^, round to zero 

(INT-DIV_FLOOR t u v iv) ... < u r u; lA A 

64 47 42 37 32 27 22* ' * 0 

exp — unbiased exponent of w 
temp — v * w/2^^^, round to zero 
t ^ temp * 2*'P, round to floor 

(INT_DIVJLOOR-TEST t u v w) t u v w IB A' 

64 47 42 37 32 27 21* * ' 0 

exp — unbiased exponent of u; 
temp — V * round to zero 

t — temp * 2*^, round to floor 

These operations are the last step in integer division. The product of the integer v and Spe- 
cialFloat64 w is shifted right according to the exponent of w and rounded, producing an integer. 

The -TEST versions of these operations generate cany when the quotient is not exact, i.e. when 
the division by 2""*^ yields a non-zero remainder. 

If r is outside [-2^^ ... 2^ - 1], the float.extension exception is raised and the result in i mav be 
incorrect. 

-AJthough register u is not used in the current hardware implementation, the software requires u to 
contain the denominator in order to properly handle float .extension exceptions. 



RAISES 

float.extension 
SEE ALSO 

UNS J)IV, §12.6 

Integer Operations INT JDIV. 
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... r 5 F ... ac y 10 39 MC 

64 61 56 51 4T 21 16*11 6 0 



(INT_FETCH^DD_4C JDISP r s ac disp) ... rsC ... ac sdisp 11 MC 

temp — (word at 5 -f- 'disp mod 2^^}; 

(word at s + 'disp mod 2*^) - r + (word at s -i- 'disp mod 2^«), with 'ac: 
r — icmp 

{where 'disp € (0 . . . 16383], '5</w;> = 'disp/8} 

(INT JTETCH-ADD^CJNDEX r s ac y) 
temp — (word at s + 8 * y mod 2^); 

(word at 3 + 8 * y mod 2''8) - r + (word &ts + 8*ymod 2^«), with 'ac; 
r — temp 

{INT_FETCH^DD_DISP r s disp) ... rsC ... sdisp 10 MC 

«4 (1 M SI 4T ai ^S 0 

femp ^ (word at 5 + 'rfjsp mod 2^*); 

(word at 5 + 'disp mod 2*«) ^ r + (word at 5 + 'disp mod 2'"); 
r temp 

{where Wwp € (0 . . . 524287], 'sdisp = 'disp/8} 
(INTJETCH^DDJNDEX r^y) ... r 5 F ... 00 y 10 38 MC 

«« 61 it 41 47 21 16*11 to 

temp — (word at 5 + 8 » y mod 2**); 

(word at 5 + 8 * y mod 2"*) - r + (iord at ^ + 8 * y mod 2^«); 
r /cmp; 

These operations generally behave like a LOAD operation Mowed by a STORE operation with 
respect to access control. If ac is present, it is used; otherwise the access control field of s is used. 



RAISES 

data_hw.error. data.prot, data-alignment, data_blocked 
COUNTS AS 

CNTiNTJ"ETCH_A.DD 
SEE ALSO 

LOAD, INT_^DD 
INTJ"ETCH«ADD_ 
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(INT JMM t value) ... t value 02 . A 

1 — value 

{where 'value € (-2" . . . 2'-' - 1]} 

This operation loads a signed immediate constant into register t. 



RAISES 
(nothing) 

SEE ALSO 
BIT3IASK 



Integer Operations 



INTJMM 
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(INT_LOADB r s) 

r - sign exiend(byte at s), with FEJ^ORMAL 
(INT_LOADB^C_DISP r 5 ac rfwp) ^ , 

r - sign extend(byte at s + 'disp mod 2*^), with 'ac 
{where 'disp € (0 . . . 16383]} 

(INiaOADB^CUNDEX...,, ..- ..V./. ,,«c.,.lS.30. MC 

r - sign extend(byte at 5 + y mod 2«), with 'ac 

(INTJLOADBJDISP r 5 cfwp) 

r - sip extend(byte at s + Wi^i, mod 2«), with FEJfORMAL 
{where 'dwp 6 [0 . . . 524287]} 

(INT J-OADB JNDEX r 5 y) r 

r - sign extend(byte at 5 + y mod 2"*), with FEJ^ORMAL 



RAISES 

data_hw.error, data.prot, data^gnment. data.blocked 
COUNTS AS 

CNTJ.OAD 
SEE ALSO 

STORES, UNSaOADB 



INTJ^OADB. 
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(INT XOADH r 5) ... r s 1 ... M 

^ 64 61 56 51 47 0 

r ^ sign extend(half\vord at s). with FEJCORMAL 
(INTJLOADH^CJDISP r 5 ac (fispj ... r 5 C ... ac sdispS MC 

^ 64 61 56 ii 4: 2J J6 ^4 0 

r — sign €Xtend(half\vord at 5 -h 'disp mod 2^®), ^vith 'ac 
{where 'disp € [0 . . . 16383]. 'sdisp = '(fzsp/4} 

(INT-LOADH-ACJNDEX r 5 ac v) ... r 5 F ... ac u 12 39 MC 

64 61 46 51 4: 21 16''ll 6 0 

r — sign €xtend(halfword at 5 + 4 * y mod 2^®). with 'ac 
(INTJLOADHJDISP r 5 cfz5p) ... rsC... sdisp B MC 

64 61 56 51 47 21 ^4 0 

r — sign €xtend(halfword at 5 -f mod 2^^), with FEJ^ORMAL 
{where 'disp 6 (0 . . . 524287]. •sdi5p = . 'disp/4} 

(INTJLOADH JNDEX r 5 y) ... r 5 F ... 00 v 12 38 MC 

64 61 56 51 4: 21 16^11 6 0 

r — sign extend(halfword at 5 + 4 ♦ y mod 2^^®), with FEJ^ORMAL 

These operations load a signed halfword from memory. The fe.control is taken from the ac field if 
present, or forced to FEJvORMAL. If ac is present, its forward, data trapO. and data trapl disable 
bits are used; otherwise those of s are used. 



R-^ISES 

dataJiwjerror, data-prot, data-alignment, data-blocked 
COUNTS AS 

CNTiOAD 
SEE ALSO 

STOREH, UNS J-OADH 



Integer Operations 



INTJ.OADIL 
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(INT J.OADQ r 5) 

r - sign extend(quarterword at s), with FEJJORMAL 

r - sign extend(quarterword at 5 + 'disp mod 2*^), with 'ac 
{where 'disp € (0 . . . 16383], 'sdisp = 'disp/2} 

(INT_LOADQ^C JNDEX r s ac y) • • • r 5 F . . . ac y 14 39 MC 

t* 41 it SI «T 21 16*11 4 0 

r Sign extend(quarterword at 3 + 2 * y mod 2*^), with 'ac 
(INT J.OADQ JDISP r . <f„p) . . . r . C . . . ,<f£5p 4 MC 

»4 61 it il 4T 21 "0 

r - sign extend(quarterword at 5 + 'disp mod 2'*«), with FE -NORMAL 
{where 'disp € [0 . . . 524287], 'sdisp = 'disp/2} 

(INT_LOADQ JNDEX r 5 y) ... rsF ... 0 y 14 38 MC 

64 41 i4 41 4 7 21 14*11 4 0 

r - sign extend(quanerword at 5 + 2 • y mod 2"*), with FE_NORMAL 

These operations load a signed quarterword from memory. The fe^ontrol is taken from the ac 
field if present, or forced to FE_NORMAL. If ac is present, its forward, data trapO, and data trapl 
disable bits are used; otherwise those of s are used. 



RAISES 

dataJiw^rror, data-prot, data-alignment, data-blocked 
COUNTS AS 

CNTJ-OAD 
SEE ALSO 

STOREQ, UNS.LOADQ 

INT.LOADQ- 
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(INT-LOGB r y) 

X — unbiased exponent of float y 



64 



. . . ^ y 

1 21 ic'n 6 0 



. X y OB 00 C 



(INTJLOGB.TEST x y) 



... X y OB 01 C 



64 



X — unbiased exponent of float y 

These operations determine the floor of the base 2 logarithm of a floating-point number. 

For denormaiized y, x will take on values less than the minimum exponent so that scalbfx, -logb(x)) 
is always less than two and greater than or equal to one. When y is infinity or NaN, the maximum 
positive integer is returned. When y is zero, the minimum negative integer is returned. 

INT-LOGB_TEST never generates overflow/NaN, and generates carry- if y is infinity, NaN or zero. 



RAISES 



(nothing) 



Integer Operations 



INTJ.OGB. 
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essor Principles of Operation 



^ ... < u u IF OE ... A 

t — max(u, v), integer 

(INT^AX.TEST r « u) ... t u . if OF A 

«< 47 42 jr 32 37 21* "o 

i — ma.\(u, v), integer 
These operauons select the larger of the two integer operands. 

INTJVIAX.TEST never generates overflow/NaN, and generates carry if u is selected, meaning 



IL\ISES 

(nothing) 
SEE ALSO 

INT-MIN, SELECTJNT, FLOAT-MAX 

INT-MAX. 
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(INT JvlEM_ADD«AC JDISP r s ac disp) ... r $ ? ac sdisp 13 MC 

(word at 6 + 'disp mod 2*'^) — r -h (word at 5 + 'disp mod 2"**). with 'ac: 

{where 'disp € [0 . , , 16383), 'sdisp = 'disp/S} 
(INT JVIEM_ADD^\C JNDEX r 5 ac y) ... r 5 F ... ac y 13 39 MC 

64 61 56 SI 4: 21 le'n 6 0 

(word at 5 + 8 * y mod 2^^) - r + (word at 5 + 8 * y mod 2^^), with 'ac; 
(INT«MEM-ADD JDISP r 5 (f:5p) ... r^F ... 5di5p 12 MC 

64 61 S6 61 47 21 ^5 0 

(word at 5 + 'disp mod 2^^®) — r -h (word at 5 + 'disp mod 2^^); 

{where 'disp e (0 . . . 524287], 'sdisp = 'disp/8} 
(INTJ^EM^DDJNDEX r s y) ... r 5 F ... 00 y 13 38 MC 

64 6] i6 il 4: 21 16*21 6 0 

(word at 5 + 8 » y mod 2^^) — r + (word at 5 + 8 * y mod 2^®); 

These operations generally behave like a LOAD operation followed by a STORE operation with 
respect to access control. Unlike INTjrETCH.ADD, the r register is not modified. If ac is present, 
it is used; otherwise the access control field of s is used. 



RAISES 

dataJiw.error, data.prot, data alignnient, data.b}ocked 
SEE ALSO 

LOAD, INTJIDD 

Integer Operations ^ INT-MEM Ji.DD. 
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(INT_MIN t u v) 

<-niin(u,r), integer " '''''''' 

(INT_MIN.TEST t u v) 

t u f IE OF ., 



t — min(u, v), integer 



♦ T <2 37 32 27 2\"q 



RAISES 

(nothing) 
SEE ALSO 

INTJVIAX. SELECTJNT, FLOAT-MIN 
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(INT JIECIP^PPROX xy) . . . z y 08 00 C 

U 21 16 n 6 0 

I — an approximation to l/y. floating point 

This operation is used to compute an integer reciprocal. It performs a table lookup operation, fol- 
lowed by a linear interpolation using an adder-multiplier. The result is returned as a SpecialFloat64. 
If the y operand is zero, the float.extension exception is raised. 



RAISES 

floatjextension 
SEE ALSO 

FLOATJLECIPJ^PPROX, §12.6 

Integer Operations . INTJIECIP-APPROX- 
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(INT_RECIP_ERROR t v w) 

t-l.O-v tt\ floating point, round to nearest 



... t IC V u; 00 . 

« *r «2 JT 33 2T 21 



This operation is used to perform integer division Thp PPPno 

Newton's method iteration us.ng the Xr^n^i^^^^^^^ "^.^ '^'^^ 

a SpedaIFloat64 and t is returned as a Float64. ' ""'^"^ 



RAISES 

(nothing) 
SEE ALSO 

INTJ)IV.CHOP,INTJ)r\'J"LOOR, INTJIECIP.^PPROX, §12.6 
INT_RECIP^RROR 
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(INT-RECIP^HIFT x y) . . . r y OE 00 C 

64 21 16 li e 0 

X — log2ab$(y), round to ceiling 
(INT JIECIP^HIFT.TEST i y) . . , r y OE 01 C 

64 2J IC^II 6 0 

X log2abs{y), round to ceiling 

These operations are used to compute integer reciprocals. They compute the ceiling of the base 2 
logarithm of the absolute value of y. When y is zero, x is set to -1. 



RAISES 

(nothing) 
SEE ALSO 

INTJ)IV.CHOP, §12.6 

Integer Operations , INT JRECIP-SHIFT- 
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(INT_RSQRT^PPROX r u) 

■ • • r y 09 00 C 
X - an approximation to 1/^. floating point " * " 

a uli?il'f ° "''^ '° ''^^ ^'^'P™^^ ^'^""^ of a denormalized number It performs 

=HEr="— ^^^^^^^^ 



RAISES 

(nothing) 
SEE ALSO 

FLOATJlSQRT«A.PPROX. §12.5 

INTJISQRT-APPROX- 
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(INT3HIFTJIIGHT < V u-} ... t ik v w 00 A 

t — V>atV 

(INT_SHIFT_RIGHT i y :) ... x y :1E C 

64 2] 16 11 e 0 

(INT^HIFT_R.IGHT_TEST i I- u) ... i lA i- u; 01 . A 

64 «7 *3 J7 32 27 21 0 

t — i»au; 

(INT_SHIFT_RIGHT_TEST r y .-) ... ary.-lF C 

64 21 16^11 6 0 

These operations do an arithmetic shift right, filling bits on the left with copies of the sign bit. 
Unsigned shift counts in w/z are taken modulo 64. 

The -TEST version generates carry if a 1-bit is shifted out of v or y and never generates over- 
flow/NaN. 



RAISES 

(nothing) 
SEE ALSO 

UNS^HIFTJUGHT, SHIFT-PAIILRIGHT, SHIFT-LEFT, INT J)IV JLOOR 
Integer Operations , INT-SHIFT JUGHT- 
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cesser Principles of Operation 

(INT^UB t u v) 

... u v ID OE ... A 

^ . 64 47 42 37 32 2T 21 0 

t ^ u - V, integer 
(INT^UB X y z) 
X - y - r, i] 

(INT-SUB.TEST t u v) 
t ^ V, integer 

(INT^UB.TEST x y z) 



^ . - 64 2J le'ji 6 0 

X — y - r, integer 

. / u i; ID OF A 

. . 64 47 43 37 32 27 21 0 

t u - r, integer 

... z y 227 C 

64 21 16^13 6 0 



X ^ y - 2:, integer 
These operations do two^s-complement integer and unsigned subtraction. 

The resulting condition code from the .TEST version of this operation has its twoVcomplement 



RAISES 

(nothing) 
SEE ALSO 

INT^UB JMM, UNS^UB_CARRY.TEST 



INT^UB. 
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(INT^UB JMM t u value) ... t u l lvalue 20 . . A 

t — u — 'value, integer 

{where 'value € (1 . . . 512]. 'bvalue = 'value - 1} 

(INT^UBJMM r y fa/ue) . . . ^ j, bvalue 06 C 

X — y — 'value, integer 

{where 'value € (1 . . . 32), 'bvalue = 'value - 1} 

(INT-SUB JMM_TEST / u value) ... t u l bvalue 21 A 

*« 47 4: 37 36 ST 21 0 

t — u — 'value, integer ■ 

{where 'value 6 [1 . . . 512], 'bvalue = 'value - 1} 

(INT_SUB J[MM_TEST X y ra/ue) ... x y bvalue 07 C 

64 21 16^11 6 0 

2 y - 'value, integer 

{where 'value € (1 . . . 32], 'bvalue = 'value - 1} 

These operations effectively subtract a constant between 1 and 32(512) to y{u), storing it in x(t). 

The resulting condition code from the .TEST version of this operation has its two's-complement 
definition. 



RAISES 

(nothing) 
SEE ALSO 

INT^UB JMM, INT-IDD JMM 

Integer Operations , INT-SUB JMM. 
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(INT_SUB_MUL tuvw) 
(INT^UB JVIUL.TEST t u v w) 

' ^ ' t u V w 2B . A 
<-u-i,*tt-,inieger " " 

(INT^UB_MUL_REV r « „ «.') 



. . . ^ u r li; 2E A 

« - -« + u * integer " " " " " »r «• • •, A 



^ . . . ^ « r ti; 2F A 



(INT^UB J^UL JIEV.TEST t u v w) 

integer 

These operations do .woVeomplemen. multipBcaio. follo.«i bv subtraction 

,t tf i'^ti;rortb: I'rjz^™ - '-'^^ 

If u or u; is outside f— 2*^ 9*3 « . 

sioei - ■••2. - IJ, the float.extension exception is raised. 



RAISES 

float .extension 
SEE ALSO 

INT^UB, INT_\DD«MUL 



INT-SUB_MUL. 
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(JUMP mask cn tn) ... mask cn F I S tn C 

64 21 13 11 • 6 3 0 

if CVfn £ 'mask then 
ssw.pc — TN: 

end 

(JUMP-OFTEN mask cn tn\ ... mask cn F 07 tn C 

64 21 :j 11 : 5 3 0 

if CVcn € 'mask then 
ssw.pc — TN; 

else 

stop lookahead: 

end 

( JUMP^ELDOM mask cn tn) ' ... mask cn F 17 tn C 

64 21 13 U r 6 3 0 

if CV'cn € 'mask then 
SSW.pc — TN; 
stop lookahead; 

end 

These operations do conditional branches if the selected condition code is a member of the condition 
mask. If taken, the branch goes to the location previously loaded into target register TN by a 
TARGET operation. The value mask is an eight-bit condition mask of type CondMask, described 
in §4. If mask is empty, then the test is false. 

The trap mask ssw.tm, mode bits ssw.md, and condition vector cv are unaffected. 

A JUMP-OFTEN operation is intended for branches that will be taken frequently, such as the 
backwards branch in a loop or a branch over an error condition. Lookahead is stopped if the 
JUMP.OFTEN fails. Lookahead is stopped if the JUMP^ELDOM succeeds. 

JUMP.OFTEN and JUMP^ELDOM can be monitored via CNTJUMP J:XPECTED and CNT.- 
JUMP.UNEXPECTED to observe branch prediction accuracv. JUMP onlv counts toward CNT.- 
TIL\NSFEILTOTAL. 



RAISES 

(nothing) 
COUNTS AS 

CNTJUMP J:XPECTED if expected path taken, CNT JUMP-UNEXPECTED if unexpected path 
taken, CNT.TILA.NSFEILTOTAL 

SEE ALSO 

SKIP 



Jump Operations 



JUMP- 
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-cesser Principles of Operation 



(LEVEL_ENTER lev) 

if (LEVEL = 'lev) then 

LEVEL — program map execute protection level: 



... 00 01 /er 0 0 6 0 

Jl 1« 13 U 7 6 J 0 



else 



end 



SSW.ssw.override — true: 
suppress program protection exception 

raise program protection exception 



** 21 16 13 11 T « 3 0 



(LEVEL_RTN lev tn) 

if LEVEL > 'lev then 
ssw.pc — tn; 

SSV/. ssw^verride — false; 
LEVEL — 'lev, 

else 

raise privileged operation exception 

end 

These operations change the privilege level of a stream (see §8.1) The LEVEL_ENTETl on^r,ti«n 

rJsed wtther^he onV^wf I T '. 7'°°' '"'"'^^ '''''' ' --P^-^ 

or not* a LEVElIt^^^^^^^ '^"''''^ P'^^ram map execute protection level 

be r^se^ ^^^^^-^^^ "^^°^P^^ ^° Privilege level, a privUeged operation exception will 

^ml-TEtstTj ^,^^^,^-^NTER sets ssw.override. However, lookahead beyond a 
RTN • '° I-°°kahead is not disabled for LEVEL.- 



RAISES 

privileged, prog.prot 
COUNTS AS 

CNTJ.EVEL if LEVEL J:NTER 
SEE ALSO 

DOMALVJ,EAVE 



LEVEL. 
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(LOAD r 5j ... r 5 4 ... M 

r - (word at 5), with FE«XORMAL 

(LOAD^CJDISP r 5 ac d25/)) ... r 5 D ... ac sdisp 11 MC 

64 61 i6 51 4: :i 16 5 0 

r — (word at 5 + 'disp mod 2"*^). with 'ac 

{where "rfwp € [0 . . . 16383), 'stfzsp = 'disp/S} 

(LOAD-AC JNDEX r 5 ac y) ... r 5 F ... ac y 08 29 MC 

64 61 56 51 47 21 16''ll 6 0 

r — (word at 5 -h 8 ♦ y mod 2"*^). with 'ac 
(LO AD JDISP r s disp) ... rsd ... sdisp 10 MC 

64 61 56 51 47 21 '^S 0 

r — (word at 5 + '^fzsp mocf 2^^). with FEJs'ORMAL 
{where 'disp € [0 . . . 524287], ^sdisp = 'disp/8} 

(LOAD JNDEX r 5 y) ... r s F . . . 00 y 08 38 MC 

64 61 56 51 47 21 16 11 t 0 

r — (word at 5 + 8 * y mod 2^®), with FE-NORMAL 

These operations load a word from memory. The fe.control is taken from the ac field if present, 
or forced to FEJS'ORMAL. If ac is present, its forward, data trapO, and data trapl disable bits 
axe used: otherwise those of 5 are used. They are used to load floating-point numbers and 64- bit 
signed and unsigned integers. 



RAISES 

dataJiwjerror, data.prot, data alignment, data.blocked 
COUNTS AS 

CNT-LOAD 
SEE ALSO 

STORE 



Jump Operations 



LOAD. 
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(LOAD_FE r s) 

... r s 0 ... M 
(word at.) " """" ' 

This operation loads a word from memory, obeying the fe.control in the pointer. It is used to load 

fneTads'^r"^ " T ' V'' ''^'^ "'^^'^^'^ ^^'«eers. This operation allows synchroniz- 
ing loads to be performed without using explicit access control in the operation. 



RAISES 

dataJiw^rror. data-prot, data-alignment, data-blocked 
COUNTS AS 

CNTJ^OAD 
SEE ALSO 

STORE.PTR.SET»'VC 



LOADJE. 
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{LOGICAL J^LLONE t mask cn) ... t OB mask cn 00 A 

64 i7 42 3T 29 27 21 0 

i — if CVcn c 'mask then - 1 else 0 end 
(LOGICAL^LLONE-TEST / nz(25A' cn) ... r OB mask cn 01 A 

64 47 42 37 29 27 2l' " 0 

t — if CVcn c 'mask then - 1 else 0 end 

These operations convert a condition code into a logical value in all the bits in t. In contrast, 
LOGICAL-ONE produces a single-bit logical vaJue. 

The fields mask and cn together refer to cVcn, a condition code that was previously generated. The 
value mask is an eight-bit wide condition mask of type CondMask, described in §4. 

The resulting condition code from the .TEST version of this operation has its two's-complement 
definition. 



RAISES 

(nothing) 
SEE ALSO 

LOGICAL.ONE, SELECTJNT 



Logical Operations 



LOGICAL_\LLONE. 



136 



s-essor Principles of Operation 



. . .. J OA mask cn 00 A 



A 



(LOGICAL.ONE / mask cn) 

' - if cv„ c VnaiA- then 1 else 0 end 
(10GICAI,.0NE.TEST , oA nu^ „ oi 

. . «« 47 42 JT 29 27 2l'"o 

t - If cven 6 -mask then 1 else 0 end 
i.u^jiL<Ai.J\.LLONE produces a full-word logical value. 

The fields mask and cn together refer to cv,„, a condition code that was previously generated The 
value mask :s an eight-bit wide condition mask of type CondMask, described in §4 

defil^tion!'''^ -^^^^ "^""'"^ twoVcomplement 



RAISES 

(nothiag) 
SEE ALSO 

LOGICAL-^LLONE, SELECTJNT 



LOGICAL.ONE. 
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(NOP) ... 00 00 6 ... M 

no action taken 

(NOP) ... 00 00 00 00 02 ... A 

64 47 i2 Z: 32 27 21 0 

no action taken 

(NOP) ... 00 00 19 00 C 

64 21 16 • 11 6 0 

no action taken 
These operations do nothing. 

The M-unit NOP is encoded as a UNS-LOADQ into register rO from register rO, The A-unit NOP 
is encoded as an INT JMM into register rO. The C-unit NOP is encoded as a CLOCK into register 
rO. 



RAISES 

(nothing) 
COUNTS AS 

M-unit NOP as CNT-MJsOP; A-unit NOP as CNTJIJJOP; C-uait NOP as CNT.C-NOP 
Probe Operations NOP 
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maplevel — level required for 'access at 5 -f 'disp: 
if (LEVEL < maplevel) and ( Vet; > level) then 

raise data protection level exception 
else if (mjn( 7ev. LEVEL) > maplevel) and (s has proper access control) then 
r — pointer to the last byte in the segment 

else 

r -0 

end 

{where 'disp 6 [0 . . . 16383], 'sdisp = 'disp/8} 
(PROBE JNDEX r s Uv access y) r 5 F ... lev access 0 y 00 39 MC 

«« 61 it il 47 21 19 le i«^n « 0 

maplevel — level required for 'access at 5 + 8 « y; 
if (LEVEL < maplevel) and ( 'lev > LEVEL) then 

raise data protection level exception 
else if (min(7ei;. LEVEL) > maplevel) and {s has proper access control) then 
r — pointer to the last byte in the segment 

else 

r -0 

end 

These operations are intended for checking the validity of address parameters passed from routines 
at lower protection levels to routines at higher protection levels. The protection level to check 
pnvilege is given by 'lev. It is specified by a member of the Level enumeration, such as LEV.USER 
(see §8.1). If the pointer s lies beyond the map limit for this domain, the map level is set to 
LEVJPL. The kind of access check given by 'access is one of the following codes: 

^ame Value Meaning 

ProbeControl 

P^EAD 0 check if the address is mapped for 

reading 

P-MODIFY 1 check if the address is mapped for 

modiUcation 

Besides checking whether the addressed location can be read or written at the given privilege level, 
these operations make sure that forwarding, trapping, and memory full bit testing are aU disabled 
in the pointer s. The pointer returned in r has the same access control field as s. 



RAISES 
data4)rot 

SEE ALSO 
DAT.A_-MAP. 



ProbeControl 
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(PROGRAM_CACHE_FLUSH . - . 0 0 u 00 04 OA . . . A 

64 47 44 42 27 :2 2' 21 0 

flush (program instruction at u) from program instruction cache 
(PROGRAM.CACHE_FLUSH^A.NY u] . . . 0 0 u 00 07 OA . . . A 

64 4T 44 43 37 :2 27 21 0 

flush any (program instruction at u) from program instruction cache 
(PROGRAM.CACHEJFLUSH_Ll u) . . . 0 0 u 00 05 OA . . . A 

64 47 44 42 37 32 27 21 0 

flush any (program instruction at u) from Ll program instruction cache 

These are supervisor-privileged operations to maintain consistency in the program instruction 
caches. 

The address in ti is a physical word offset into local memory. The PROGR-\M-CACHE-FLUSH 
operation is used to flush a single page of instructions, as after storing new data in a page frame. 
The PROGRAM_CACHE_FLUSH«A.NY operation is used to flush any entries from the caches, 
presumably only during system initialization. Since the cache is not fully associative, multiple 
flushes may be required-each flush should specify a different cache line. See §7.2. The PROGRAM^- 
CACHE-TLUSH-Ll operation flushes the Ll cache only. This restriction allows multiple pages to 
be flushed from Ll more quickly without disturbing the L2 cache. 



RAISES 

privileged 
SEE ALSO 

PROGRAM-MAP- 

Program Map Operations ^ PROGRAM-CACHE- 
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(PROGRAM_MAP_FLUSH u) 

' .0 0 " 00 02 OA A 

flush (program map ac .) fr<m> program map cache .■ ' 

(PROGRAM31AP JLUSH^Y .) ... 0 0 „ 00 03 0, A 

flush auy (program map a, „) from program map each;* ' " " " ' ' 

la«ou ca^r'™""'**'' " - P-S'- a^^-s trans- 

Sp PI tVch ™« > ""SP limit exception. The PKOGRAM ■ 

r - - -^ ^^^^^^^^^^ stjli^ t 

required, each flush should specify a different page modulo 128. See §7.1. 



RAISES 

privileged 
SEE ALSO 

DATA_MAP. 

PROGR.\.\U\IAP. 
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(PROGRAM3TATEJEIESTORE u) ... 0 0 u 00 00 OA ... A 

64 4T 44 43 27 Z7 27 Zl 0 

(program state descriptor for domain in u-63 - 60) — u 
This supervisor-privileged operation is used to set the program state descriptor. 

The register u contains a program state descriptor: see §8.5. The Bits 63-60 of u specify the 
protection domain, i.e. the program state descriptor is self-tagged. 



RAISES 

privileged 
SEE ALSO 

DATA^TATE. 



Pointer Operations 



PROGRAM^TATE- 
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(PTR^ET^C tuac) 

• lA ac 08 A 

(PTR^ET_AC xyac) 

.... z y ac 20 C 
I - y, with 'ac '* » * 0 



RAISES 

(nothing) 
SEE ALSO 

LOADJ-E.STORE 

PTR_SET. 
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(REAL_FLOAT t v) 



64 



4: 42 2' Z2 77 



t 00 V 19 OE 



2i 



0 



A 



t — rounded float t 

This operation is used to do rounding conversions from 64-bit floating point 10 32-bit floating point. 
The inverse of this operation is FLOATJIEAL. 

If V is NaN, t is that NaN after rounding the least significant bits from the significand and or'ing 
in a one at bit 3 to preserve NaN identity. 



RAISES 

float^verflow, float.underflow, float Jnexact 
SEE ALSO 



FLOATJIEAL 



Pointer Operations 



REAL- 
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r — (word at 5 + 'disp), with 'ac; 
(poison at V) — (full at 5 -f 'disp) 

{where 'disp € [0 . . . 16383], 'sdisp = ^15^/8) 

(REG_LOAD_AC JNDEX r s ac y) ... r 5 F ... ac y 09 39 MC 

r <^ (word at 5 + 8 ♦ y), with ac: 
(poison at V) (full at 5 + 8 ♦ 2/) 

(REG J)ISP r . <f«p) . . . r . D . . . sdisp 00 MC 

r «- (word at s + 'disp) 
(poison at V) — (full at 5 + 'disp) 

{where 'disp € (0 . . . 524287], 'st/zsp = 'disp/S} 

(REG J,0 AD JNDEX r . j,) ... r . F ... 00 y 09 38 MC 

«4 61 56 SI 47 J J 16*11 6 0 

r — (word at s + 8 ♦ y) 

(poison at V) — (full at 5 + 8 * y) 

These operations load the word and the memory fuU bit of the access state (§6.1) from the addressed 
memory ceU. The word is stored in register r and the memory full bit is stored in the poison bit 
for register r. 

These operations are only subject to the trapping and forwarding normally controUed by the access 
state of the addressed memory location. If ac is present, its forward, data trapO and data irapl 
disable bits are used: otherwise those of s are used. 



R-'MSES 

data_hw.error, data_prot, data_alignment, data-blocked 
COUNTS AS 

CNT_LOAD 
SEE ALSO 

REG^TORE. REG-MOVE 

REG-LOAD. 
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(REG-MOVE / f) ... f 09 r GO 00 ... A 

^ e4 i' 42 3T 32 2: 21 0 

t — v; 

(poison at 't) — (poison at V) 
(REG JMOVE xy) . . . x t/ 02 00 C 

64 21 16 11 6 0 

(poison at z) *- (poison at y) 
These operations copy data froni one register to another, without raising a poison exception. 



RAISES 
(nothing) 



Pointer Operations 



REGJ^OVE. 
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MC 



(REG^TORE^^C J)ISP r s ac disp) r c p 

(word at 5 + disp) - r. with 'ac: 
(full at ; + 'disp) - (poison at V) 

{where yz^p g [o . . . 16383]. Wzsp = 'disp/8} 

(REG^TORE^CJNDEX r s ac v) 

(word at 5 + 8 . y) - r, with 'ac: " »» . o 

(full at 5 + 8 * y) - (poison at V) 

(REG^TORE_DISP r 5 t/wp) 

, • • • r s F . . , 5<ftsp 00 MC 
(word at . + 'cf«p) - r " """" " * « 

(full at 5 + 'disp) - (poison at V) 

{where 'rfwp € (0 . . . 524287], 'sdisp = 'disp/8} 

(REG^TORE JNDEX r s v) 

• • . r 5 F . . . 00 w 01 38 MC 
(wordat. + 8*y)-r " """" " * ' 

(full at s + 8 . y) - (poison at V) 

Ifd'^lT"'""' "°n "'""^^^ ^'^ °f ^««" state(§6.1) and the value in the 

the e on ,T ""^ v^''" ^°=^Pl«">««in6 the action of a REG-LOAD operation Thus 

these operations are not subject to a poison exception due to r. operation. iHus, 

s^aL'Tr^'SS' '"^j'" '° '^^PP»°S °^ forwarding normallv controUed by the access 

Slfb t it H 7""°'' ^ d«a trapO and data Tr^pl 

disable bits are used: otherwise those ofi are used. trapi 



RAISES 

dataJiw.error, data.prot, data^gnment. data.blocked 
COUNTS AS 

CNT_STORE 
SEE ALSO 

REG-LOAD. REG-MOVE 



REG-STORE. 
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(RESULTCODEJSAVE 2) ... r 00 ID 00 C 

64 2: 16 :: 6 o 

X — RESULTCODE^ lookahead index 
RESl'LTCODE — 0 

This operation saves and clears the result code register, described in §9.1. The value of RE- 
SULTCODE is undefined after an instruction combining RESULTCODE^AVE with a floating-point 
A-operation. The data resultcodes are rotated so that drO corresponds to opaO. 



RAISES 

(nothing) 
SEE ALSO 

EXCEPTION-, DATA.OPA^AVE 

Rotate Operations ^ RESULTCODE. 
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cesser Principles of Operation 



(ROTATE-LEFT z y z) 



64 21 16*11 6 0 



(ROTATE_LEFT.TEST x y z) . . . x u z IB C 



(ROTATE-RIGHT i y z) 
X - ywr 



... r y r02 C 



(ROTATE JIIGHT.TEST x y z) ... x y zOZ C 

64 21 16*11 6 0 

X — y^z 

These operations rotate a word to the left or right. They compute the unsigned rotation amount 

z modulo 64. 

The .TEST version generates carry if a l-bit is rotated out of one end of y and into the other end, 
and never generates overflow/NaN. 



* 



RAISES 

(nothing) 
SEE ALSO 

SHIFT_LEFT, INT^HIFTJUGHT, UNS^HIFTJIIGHT, SHIFTJAIR 
ROTATE. 
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(SELECT-FLOAT t u v floatselect cn) ... t u v JJoatselect cn 06 ... A 

61 i: 43 37 2T 79 27 Zl 0 

t — if C\'cn € 'floatselect then u else v end 
(SELECT-FLOAT-TEST f li i-yjoarse/ec/ cn) ... t u v floatselect cn 07 ... A 

64 4T 43 37 2T 29 37 :: 0 

t — if CVen 6 'floatselect then v else r end 
(SELECT JNT / u v intselect cn) ... t u v intselect cn 04 .. . A 

64 47 42 37 32 29 27 22 0 

t — if CVcn € 'intselect then v else r end 
(SELECT JNT-TEST f ti r znt5e/ecf cn) ...fur intselect cn 05 A 

64 47 42 37 32 29 27 2: 0 

. f ^ if CVcn 6 ^intselect then u else r end 
These operations are used to conditionally select the value u or r. 

The masks intselect and floatselect describe the values of the condition in cv^n that will select u 
rather than r: see §4. 

The condition test intselect may be one of SEL.CY, SELJEQ, SELJGT, SELJGE. SEL.UGT, 
SEL-UGE. SEL JPL, or SEL JPZ. Reversal of m and v effectively vields the additional tests SEL.- 
NX. SEL-NT. SELJLE. SELJLT. SEL.ULE, SEL.ULT. SELJMZ, and SELJML The condition 
test floatselect may be one of SELJLT. SEL-FGE, SELJGT, SELJLE, or SEL JUX. Reversing 
u and V yields additional nameless conditions. Selection based on floating-point equalitv may use 
a SELECTJNT operation with SEL JQ. 

These operations are "lazy"" in that Poison in the selected value is merely propagated to the desti- 
nation. Hence, no exceptions are raised if either value is poisoned. 

The _TEST versions of these operations never generate overflow/NaN or carry. 



RAISES 

(nothing) 
SEE ALSO 

INTJvlAX, INT-MIN, FLOAT-MAX, FLOATJiIIN, BIT-MERGE 



Shift Operations 



SELECT. 
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(SHIFT_LEFT x y z) 

X — y<z 



... X y zl8 C 

H 21 It'll to 



(SHIFT>LEFT-TEST x y z) ... x y zlS C 

64 21 16^11 6 0 



A 

64 47 42 37 33' 27 21' " 0 



(SHIFT J^EFTJMM tush) ... t u 8 sh 08 a 

«4 4T 43 3T S3 IT Jl 0 

t — ti< 'sh 

{where 'sh € {0...63]} 

(SHIFT_LEFT_IMM.TEST tush) ... ..t.^«..8 .^A 09 

{where 'sh£[0... 63]} 

These operations shift words to the left, filling vacated positions on the right with 0-bits. Unsigned 
shift counts in z are taken modulo 64. 

The TEST version generates carry if a 1-bit is shifted out of u or y, and never generates over- 
flow/NaN . 



RAISES 

(nothing) 
SEE ALSO 

UNS_SHIFT jllGHT, INT^HIFTJUGHT, SHIFTJ>AIR, ROTATEJLEFT 
SHIFTJ.EFT. 
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(SHIFT_PAIR_LEFT / u f u) ... J u r 14 ... A 

64 «: 42 zr 22 2: 21 0 

t — (the pair {u. rXu')/2^ 
(SHIFT J>AIRJ.EFT.TEST ^ 1/ I- a-) . . . / u v u- 15 . . . A 

64 47 42 3: 32 2: 21 0 

i — (the pair (iz, v)^w)/2^ 
(SHIFT J>AIRJIIGHT t u v ir) t u v w 16 A 

64 4: 42 37 32 27 21 0 

t — (the pair (u, v):^'w)mod2^ 
(SHIFT_PAIRJIIGHT.TEST t u v w) t u v w 17 A 

64 47 42 37 32 27 21 * 0 

t ^ (the pair iu,v)'>w)mod2^ 

SHIFTJ^AIRJ.EFT shifts a copy of u left and fills vacated positions with bits from the left end of 
V, whereas SHIFTJ^AIRJUGHT shifts a copy of v right and fills vacated positions with bits from 
the right end of u. Unsigned shift counts in w are talcen modulo 64. 

The SHIFTJ^AIILLEFT.TEST version generates carry if those bits of u not appearing in i are not 
all 0, and never generates overflow/NaN. The SHIFT JAIR_RIGHT-TEST version generates carry 
if those bits of v not appearing in t are not all 0, and never generates overflow/NaN. 



RAISES 

(nothing) 
SEE ALSO 

SHIFT J-EFT, ROTATEJ-EFT, UNS^HIFTJUGHT, INT^HIFT JUGHT 
Skip Operations SHIFT-PAIIL 
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(SKIP mask cn offset) . . . ^nask cn hi 1 6 /o C 

if cv„ € 'mask then 

ssw.pc - ssw.pc -i- ( 'offset + 1) 

end 

{where 'offset € (0 . . . 119], 7o = 'offset mod S, 'hi = [ 'offset/8}} 
(SKIP.OFTEN mask cn offset) ... mask cn hi 0 7 /o C 

64 22 U II 7 « 3 0 

if c\cn € mc5t then 

ssw.pc — ssw.pc -r ( 'offset + 1) 

else 

stop lookahead 

end 

{where 'offset G [0 . 119], 7o = 'o#5cf mod 8, 'hi = L'o#5Ct/8J} 
(SKIP^ELDOM mask cn offset) . . . rnask cn hi 17 lo C 

64 31 13 11 ; 6 3 0 

if CV'tn g masilrthen 

ssw.pc — ssw.pc + ( 'offset + 1) 
stop lookahead 

end 

{where 'offset € [0 . . . 119], 7o = 'ojQ^^ei mod 8. 'hi = [ 'offset 

These operations do conditional forward branches if the selected condition code is a member of the 
condition mask. If taken, the skip skips the following offset instructions. The value mask is an 
eight-bit wide condition mask of type CondMask, described in §4. If mask is empty, then the skip 
always fails. 

SKIP.OFTEN and SKIP .SELDOM can be monitored via C NT JUMP-EXPECTED and CNT.- 
JUMP.UNEXPECTED to observe branch prediction accuracy. SKIP only counts toward CNT_- 
TRANSFER.TOTAL. 



RAISES 

(nothing) 
COUNTS AS 

CNTJUMP J:xPECTED if expected path taken. CNTJUMP.UNEXPECTED if unexpected path 
taken, CNT-TRANSFER.TOTAL 

SEE ALSO 

JUMP 



SKIP. 
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(SSWJDISP X o#seO x 0 0 hi 0 6 lo C 

64 :i U 13 11 ? 6 3 0 

X — 5SW -h 'offset -r 1 

{where 'offset e [0 ^ . A19]. 'lo = offset mod S, 'hi = ['offset /8\} 
(SSWJELESTORE li) ... 0 0 u 00 OA OA ... A 

64 4T 44 42 3T 32 27 21 0 

SSW.cr — u.cv 
ssw.im — u.fm 
ssw.md — li.md 

The SSW J)ISP operation is used to load a branch address into a general purpose register x, rather 
than a target register. Thus, the value may be later loaded into a target with TARGET-RESTORE 
prior to jumping to the location. It also returns the trap, mode, and condition fields of the SSW. 

The SSWJIESTORE operation is used to set the trap, mode, and condition fields in the ssw. The 
value of cv after an instruction combining SSWJIESTORE with a -TEST C-op is undefined. 



RAISES 

(nothing) 
SEE ALSO 

TARGETJIESTORE 



State Control Operations 



SSW. 
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(STATEXO AD JDISP r s d^sp) ... rsf ... sdisp 10 MC 

T — (access state at s + 'disp) 

{where 'disp 6 (0 . . . 524287], 'sdisp = 'disp/S) 

(STATE JLO AD JNDEX r s y) ... r 5 F ... 00 , 00 38 MC 

*< 61 46 J J «r Ji 16*11 6 0 

r — (access state at s + 8 * y) 

These operations load the access state(§6.1) from the addressed memory ceU, and convert the 
access state into an access control field(§6.1) stored as a pointer in registe'r r. Word alignment is 
not required; the byte select bits are ignored. 

These operations are not subject to the trapping, forwarding, or memory fuU bit waiting normally 
controUed by the access state of the addressed memory location. 

The pointer r is constructed as follows (the access control field is constructed by inverting the 
operations done by a STATE^TORE operation): 

• t^7l ^"'"^"^^ ^^^^ '^orward^nable'' in the access state) is placed in field 
Twd^sable of r. 

• A copy of data trap bit 0 (field "trapCenable" in the access state) is placed in both field 

trapO^tore-disable" and field "trapOJoad-disable" of r. 

. A copy of data trap bit 1 (field "trapLenable^ in the access state) is placed in both field 
trapljtore.disable" and field "traplJoad-disable" of r. 

• ^'^ ^^""^^ "^^«) is "s^d ^° construct the full/empty control 
rr ?vv?''f ^^'^^ '° FE J'UTURE if the memory full bit is true, and is set 

to FE-SYNC if the memory full bit is false. 

• Other bits of r are set to 0. 



RAISES 

dataJiw.error. data_prot 
COUNTS AS 

CNT_LO.A.D 
SEE ALSO 

STATE^TORE. STATE-LOCK 



ST.\TEJ-OAD- 
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( STATE JLO CK_ACJ5ISP r s flc fljsp) ... r s C ... ac sdisvOl MC 

^ ^' 64 61 56 53 4? :: 16 * 5 0 

r — (access state at s -f 'disp). v.-ith *gc: 
(access state at 5 + 'disp\ — (forwarded, empty) 
{where 'disp € [0 . . . 16383]. sdisp = 'disp/S) 

(STATE-LOCK^CJNDEX r 5 ac y) ... r 5 F ... ac y 1139 MC 

64 61 S6 53 4: 21 36^11 6 0 

r — (access state at 5 + 8 * y). with *ac: 
(access state at 5 + 8 * y) — (forwarded, empty) 

[STATZJLOCKJDIST r s dxsp) ... rsC ... sdispOO MC 

64 61 56 51 4: 21 '^S 0 

r (access state at 5 + 'disp)\ 
(access state at 5 + ^disp) — (forwarded, empty) 
{where 'disp € [0 . . . 524287], 'sdisp = 'disp/8} 

(STATEJ:.0CK JNDEX r 5 y) ... r 5 F ... 00 y 11 38 MC 

64 61 56 51 47 21 16^11 6 0 

r — (access state at 5 -h 8 » y): 

(access state at 5 + 8 * y) — (forwarded, empty) 

These operations allow atomic access state manipulation, with additional access control modifica- 
tion through the operand ac. The operation loads the access state(§6.1) from the addressed memory 
cell, and converts the access state into an access control field(§6.1) stored as a pointer in register 
r, as is done by a STATE-LOAD operation. The operation then sets the access state stored in the 
addressed memory cell to forwarded and empty. Word alignment is not required: the byte select 
bits are ignored. 

These operations are not subject to the forwarding or memory full bit waiting normally controlled 
by the access state of the addressed memory location, except that a location that is both forwarded 
and not full is considered locked. In this case, the operation fails and is retried later. However, 
data trap bits are observed as in a normal memory operation. K ac is present, its data trapO and 
data trapl disable bits are used: otherwise those of s are used. 

The STATE-LOCK operation allows a possibly unforwarded memor>- word to be forwarded in 
an indivisible manner, locking the word with an "empty forwarding pointer" access state, while 
retrieving its current value. 



RAISES 

dataJiw.error, data-prot 
COUNTS AS 

CNT-STORE 
SEE ALSO 

STATEJ-OAD, STATE-STORE 



State Control Operations 



STATE-LOCK. 
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-•ssor Principles of Operation 



(STATE^CRUB JJISP r s „sp) ... r s T ... siispOS MC 

asyn — (data syndrome at 5 -i- 'disp) 
asyn — (access syndrome at s -f 'disp) 
data — (word at 5 -i- 'disp) 
for!€[0...7]:r,-d5yn,- 
for I € [8 ... 11] : r, - a5yn,_8 
for ie[l2...63] :r, - data, 

{where 'disp € (0 . . . 524287], 5^,5;, = 'disp/8} 

(STATE^CRUBJNDEX r s y) , . p nn «n 

• • • r 5 F . . . 00 w 02 38 MC 

wyn — (data syndrome at 5 + 8 * y) 
asyn — (access syndrome at 5 + 8 * y) 
data — (word at s + 8 * y) 
for I € [0 . . . 7] : r, ~ dsym 

for z € (8... 11] :r. -a5yn._8 
forie [12... 63]: r, - data. 

I«red°mlmir/Jl f'""^^^ ^"'-""f ^^^^^ °f the ad- 

tZf^T7u they become uncorrectable multiple bit 

IZLlf^T V^^r ^"'"^^ examination of the syndrome bits. The combined 

syndrome bits for the data word and access state are returned. If there are no errors detected by 
the error-correction control logic, then these bits wiU be zero. ^ 
The appendix details the syndrome values returned. 

c^nTrouTd bv th'! u '"PP'^S' ^°^-*^di°6. or memory full bit waiting normally 

controlled by the access state of the addressed memory location. 



RAISES 

data_hw_error, data.prot. datajdignment, data.blocked 
STATE^CRUB. 
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(STATE_STORE_AC JDISP r s Gc az5p) ... r 5 E ... ac sdisp 01 MC 

64 61 &6 il 47 21 le 0 

(word at 5 -r 'disp) — r: 

(access state ai s -f 'c/z^pj — accesscontrolf^). with ac 
{where 'disp G [0 . , . 163S3j. 'sdisp = Wz^p/S} 

(STATE^TORE^CJNDEX r 5 ac yj ... r 5 F ... ac v 19 39 MC 

64 ej 56 51 4: 21 IC^ll 6 i 

(word at 5 + 8 « y) — r: 

(access state at 5 + 8 * y) — accesscontrol(5), with 'ac 
(STATE^TORE-DISP r 5 di5p) ... rsE ... sdisp 00 MC 

64 62 56 51 47 21 ^5 0 

(word at 5 + W5p) — r; 
(access state at ^ + 'disp) — accesscontroI(5) 
{where Wsp e [0 . . . 524287], 'sdisp = 'disp/ 8} 

• (STATE^TORE JNDEX r 5 y) ... r 5 F ... 00 v 19 38 MC 

(word at 5 + 8 » y) — r: 
(access state at 5 + 8 » y) — accesscontrol(s) 

These operations store both the access state(§6.1) and the value in the addressed memor>' cell. The 
value comes from register r. The access state comes from the access control field(§6.1)'of register 
s. inverting the encoding done by a STATE-LOAD operation. 

These operations are not subject to the trapping, forwarding, or memory full bit waiting normaJly 
controlled by the access state of the addressed memor>' location. 

The access state in the memory cell is constructed from the access control field of 5 as follows: 

o A copy of the forward disable bit (field ^wd_disable" in the access control) is placed in the 
forward enable bit, field "forward jenable", in the access state. 

o The logical OR of field "trap0.store.disable" and field "trapOJoad-disable" is placed in the 
-data trap 0" enable bit. field '"trapOjcnable", in the access state. 

o The logical OR of field "trapl^tore-disable" and field "traplJoad^sable" is placed in the 
"data trap 1" enable bit, fiei: "trapLenable", the access state. 

o The memory fuU bit, field •^ull^ is set if the full/empty control (field "fe.contror) is set to 
FE_FUTURE; with FE-SYNC, the memory full bit is cleared. The result is undefined if the 
field is set to FE_NORMAL. 

The exception is that if the ac (access control) operand is present, then the forwarding and data 
trap disable bits and full/empty control bits from ac replace those from 5. 

RAISES 

data_hw.error, data.prot, data-alignment, dataJ>locked 
COUNTS AS 

CNT^TORE 
SEE ALSO 

STATEJ,OAD, STATEJ^OCK 



State Control Operations 



STATE^TORE. 
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cesser Principles of Operation 



(word at 5 + 'disp) — r: 

(access state at 5 + ^disp) - corrected access state 
{where 'disp € [0 . . . 16383], 'sdisp = 'disp/8} 

(STATE^TORE J:RR0R JNDEX r s y) . . . r . F . . . 0 y 19 39 MC 

(word at 5 + 8 » y) — r: 

(access state at s + 8 . y) ~ corrected access state 

These operations store the value and conect the access state(§6.1) in the addressed memorv ceU, 
as long as there is a correctable error in the old value stored. The value comes from register'r. 

co^LZ'k^T '° '"PP'^S, forwarding, or memory full bit waiting normally 

controUed by the access state of the addressed memory location. 



RAISES 

dataJiw^rror, data.prot, datajJignment, data.blocked 

COUNTS AS 

CNT_STORE 
SEE ALSO 

STATE^CRUB 



STATE^TOREJERROR- 
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(STOREB ri) ... r s B ... M 

64 62 56 il 4- 0 

(byte at 5) — r 

(STOREB-A.CJDISP r 5 ac (izsp) ... rsE... acdisvS MC 

64 61 bt iJ 4: 22 16 * : 0 

(byte at s + 'disp mod 2^®) — r. with 'ac 
{where 'disp € [0 . . . 16383]} 

(STOREB-A.C JNDEX r s ac y) . . . r 5 F , . . ac y IE 39 MC 

64 61 &6 SI 47 21 le"^!! 6 0 

(byte at 5 + y mod 2^®) — r, with 'ac 
(STOREB JDISP r s disp) , . . r s E , . , disp 2 MC 

64 62 66 52 47 22 ^^2 0 

(byte at 3 + 'disp mod 2^®) — r 
{where 'disp 6 [0 . . . 524287]} 

(STOREBJNDEX r s y) ... r 5 F ... 00 y IE 38 MC 

64 62 56 52 47 21 16^22 6 0 

(byte at 5 + y mod 2"*®) — r 

These operations store a byte at the addressed location. If ac is present, it is used; otherwise the 
access control field of s is used. 



RAISES 

data-hw.error, data-prot, data^gnment, data_blocked 
COUNTS AS 

CNT^TORE 
SEE ALSO 

L\T_LOADB, UNS-LOADB 



Store Operations 



STOREB. 
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: issor Principles of Operation 



(STOREH r5) « 

(nalfword at 5) — r 

(STOREH^C J)ISP rsccdisp) ... rsE... ac sdisp 9 MC 

(halfword at 5 + 'c/zsp mocf 2''^) — r, with 'oc 
{where 'disp 6 (0 . . . 16383), 'stfwp = 'disp/4} 

(STOREH JICJNDEX r s ac y) ... r 5 F ... ac y lA 39 MC 

64 6> J6 SI 4T 2J 16*11 « 0 

(halfword at j + 4 « y mod 2^^) — r, with 'ac 
(STOREH JDISP r 5 rf.-5p) . . . r . E . . . sdisp 8 MC 

64 61 it SI 47 21 '^i 0 

(halfword at i + 'flfwp mod 2'*^) «- r 

{where 'dwp € [0 . . . 524287], 'sdisp = 'dt>p/4} 

(STOREHJNDEX r 5 y) ... , , ^ ... 33 

, «4 6J S6 SI 4r 21 16*JI « 0 

(halhv'ord at s + 4 • y mod 2"*) — r 

These operations store a halfword at the addressed location. If ac is present, it is used; otherwise 
the access control field of 5 is used. 



RAISES 

data_hw.error. data_prot, data-alignment, dataJilocked 
COUNTS AS 

CNT^TORE 
SEE ALSO 

INT-LOADH, UNSJLOADH 



STOREH. 
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(STOREQ rs) ... r 5 A ... M 

(quarierword at ^) — r 

(STOREQ-ACJDISP r 5 ac (/25p) ... r 5 E ... ac sdisp S MC 

64 61 56 il 4: 21 16 ^3 0 

(quarterword at 5 + 'disp mod 2^^) — r, with 'ac 
{where 'disp € [0 . . . 163S3], 5<f:5p = 'disp/2} 

(STOREQ^C JNDEX r s ac y) ... r s F ... ac y IC 39 MC 

^ " 64 61 56 51 4r 21 16^11 6 0 

(quarterword at 5 + 2 * y mod 2^^) — r, with 'ac 
(STOREQ-DISP r s di5p) ... rsE ... 5<ii5p4 MC 

^ 64 61 56 51 47 21 ^3 0 

(quarterword at 5 + 'disp mod 2^^) — r 

{where 'disp € [0 . . . 524287], 'sdisp = 'disp/2} 

(STOREQ JNDEX r 5 y) ... r 5 F . . . 00 y IC 38 MC 

64 61 56 51 4T 21 16^11 6 0 

(quarterword at 5 + 2 ♦ y mod 2"^^) — r 

These operations store a quarterword at the addressed location. If ac is present, it is used; otherwise 
the access control field of s is used. 



RAISES 

dataJiwjerror, data.prot, data^gnment, data.blocked 
COUNTS AS 

CNT^TORE 
SEE ALSO 

INTJ.OADQ, UNS-LOADQ 



Store Operations 



STOREQ. 
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.essor Principles of Operation 



(STORE r 5) 

^ ^ ... r 5 8 ... M 

(word at 5) — r 

(STORE^C JDISP rsacdisp) ... rsE... ac sdisp 11 MC 

(word at s + 'disp mod 2"^) - r, with 'oc 

{where 'disp € [0 . . . 16383], 'sdisp = 'disp/i) 

(STORE^CJNDEX r 5 ac y) r 5 F ... ac y 18 39 MC 

, , • «< 61 46 41 «T 21 16*11 6 "■'^ 

(word at s + 8 ♦ J/ mod 2^8) - r, with 'ac 
iSIOKEJ^lSV r s disp) ^ ... r s E ... sdisp 10 MC 

64 61 46 41 «r 21 ^^4 0 

(word at 5 + Ww;) morf 2*^) — r 

{where 'disp 6 [0 . . . 524287], 'sdisp = 'dwp/8} 

(STOREJNDEXrsy) • . • r . F . .. 00 y 18 38 MC 

/ J ^ 64 61 46 41 47 Jl 16*11 6 0 

(word at 5 + 8 « y mod 2"**) — r 

These operations store a word at the addressed location. If ac is present, it is used; otherwise the 
access control field of s is used. 



RAISES 

data_hw.enor. data.prot. data^gnment, data.blocked 
COUNTS AS 

CNT-STORE 
SEE ALSO 

LOAD, STORES 



STORE- 
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(STREAM-CUR.SAVE r) . . . / 08 00 02 00 . . . A 

^ 64 4: 42 3T 22 27 21 0 

t — SCURd 

(STREAMJDENTIFIER.SAVE 0 . . . / 08 00 01 00 . . . A 

^ 64 47 42 27 32 27 21 0 

/ — identifier of the executing stream 
(STREAM-LOOKAHEAD^AVE r) ... x 00 IF 00 C 

^ 64 21 16 11 6 0 

X — (lookahead index of the executing stream) * 4 
(STREAM-RES^AVE t) , , . t 08 00 03 00 . . . A 

^ . ^ 64 47 42 37 32 27 21 0 

t ^ SRESo 

The STREA\LCUR-SAVE and STREAM JIES^AVE operations respectively return the number of 
streams currently executing and the number of streams currently reserved in the stream's protection 
domain. 

The STREAM JDENTIFIER-SAVE operation is meant to help diagnose the stream management 
hardware. STREAMJDENTIFIER-SAVE returns the issuing stream's stream number that was 
assigned by the hardware when the stream was created. 

The STREAM J.OOKAHEAD-SAVE operation is used to read the three-bit lookahead lock counter 
index. It is used for mapping between the OPA and OPD registers of the M-unit cLnd the data 
result codes. Since the data result codes are four-bit values, the index returned is scaled bv four. 



RAISES 

(nothing) 
SEE ALSO 

STREAMJIESERVE, STREAMXREATE, STREAM.QUIT, §2 



Stream Operations 



STREAM. 
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cesser Principles of Operation 



(STREAM-CATCH r t x delay str) 



ssw.pc — newpc: 
ssw.md — newmd: 
ssw.tm — nexvtm: 
ssw.cy — 0; 
D newdomain; 
LEVEL ^ newlevel: 
T ^ data: 
t ^ data; 
X ^ data; 
TO — data; 
EXCEPTION is cleared; 
RESULTCODE is cleared; 
instruction counter — 0; 



generated by the hardware to complete the execution of a STRE.\M 
CREATE instruction. As such, IPL privilege is required to explicitly execute it. 



RAISES 

(nothing) 
SEE ALSO 

STREAM.CREATE 

STREAM.CATCH. 
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(STREAM-COUNTJNST x) 

J — (instruction counter) 



64 



21 16 :: e 



X 00 18 00 



0 



c 



(STREAMXOUNT JNST JIESTORE x y) 



. . . X y OA 00 C 

21 16^;: 6 c 



I — (instruction counter): 
(instruction counter) — y 

These operations manipulate the stream's instruction counter. STREAM-COUNT JNST returns 
the counter. STREAM»COUNTJNSTJIESTORE sets the instruction counter, which then counts 
down, stopping once it hits zero. When the instruction counter steps from one down to zero, the 
instruction count exception is raised. 

There is also an instruction issue counter for each protection domain for accounting and performance 
measurement. 



RAISES 

(nothing) 
SEE ALSO 

COUNTJSSUES, §10.1 



Resource Accounting Operations 



STREAM.COUNTJNST. 
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jcessor Principles of Operation 



(STREAM.CRE ATE JMM rtuzy offset) ... r 00 T t u lo OB x y hi OE MAC 

if scURip = SRES£) then 
raise create exception 

else 

SCVKo — SCVKo + 1; 

ssw.pc' — ssw.pc + 'offset: 
SSW.md' — ssw.mrf; 
ssw.^m' — ssw.fm; 
ssw.cr' ^ 0; 
d' — d; 

level' — LEVEL; 



TO' - TO: 

EXCEPTION' is cleared: 
RESULTCODE' is cleared; 
instruction counter' — 0; 

end 



{where 'offset € [-2" . . . 2" - 1], 7o = offset mod 1024, 'hi = [ b#5et/1024j } 

r^d!trr;r T^^^'" ^ ^^^^ instruction stream. The unprimed registers represent 

registers m the old stream. The primed registers represent registers in the new stream. 

!!l\V'''''°' T^^'' ^^^^^^-^^5 in the protection domain (SCUR,,) is less than the 

number reserved by pnor STREAM JIESERVE. operations (sres^), then the STREAM.CREATE 
succeeds, and state is copied from the current stream into the new stream. 

Lh>.t!f ? "'''"^^^ ^'^"^'^^ ^^*P These registers are 

fhe new !f J"' "'T: ^'^'^ g^^eral-purpose registers r, u, and y that are copied into 
The r. L'""" I'' '^^^ P^''^^"- ^ ^Sument pointer, and some stream iLtifier. 

T^Il ^- ^« safe state). 

The mstruction count register is set to zero, which will not cause a trap on issues in the stream. 

ST^Imc^tItt" """'^ ""'^ ' STREAMJIESERVE operation, can then created with a 
blKEAM.CREATE operation, and are then killed off with a STREAM.QUIT operation. 



RAISES 

create 
COUNTS AS 

CNT.CRE.A.TE 
SEE ALSO 

STREAMJIESERVE, STREAM.QUIT, §12.1 



STREAM.CREATE. 
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(STREAM.QUIT) ... 00 00 F 00 04 00 00 00 00 00 00 OA MAC 

64 61 &6 hi 47 42 J7 32 27 21 16 11 6 0 

SCVKp — SCVKp - 1: 
SRESp — SKZSd - 1: 
release trap registers: 
release OPA/OPD slots: 
stop execution; 

(STREAM.QUIT-PRESERVE) ... 00 00 F 00 04 00 01 00 00 00 01 OA MAC 

^ ^ ' 64 61 56 51 47 42 37 32 27 21 16 11 6 0 

scurd — scurd - 1: 
release trap registers; 
release OPA/OPD slots; 
stop execution; 

The QUIT operation is supervisor-privileged if field "priv.quit" is set in the program state descrip- 
tor. Privileged quit mode lets the operating system clear the stream's state before returning it to 
the hardware for reallocation (possibly to another protection domain). 

Lookahead beyond a STREAM-QUIT is forbidden. Streams which allow such lookahead will force 
the STREAM-QUIT to be retried, wasting issue slots. 

The QUIT-PRESERVE operation preserves the reservation for the current stream. 



RAISES 

privileged 
COUNTS AS 

CNT-QUIT 
SEE ALSO 

STREAM.CREATE, STREAM.RESERVE 



Resource Accounting Operations 



STREAM-QUIT 
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issor Principles of Operation 



(STREAM JIESERVE t u st) 

req — if 'st = 0 then u else 'st end 
t — if (re? < S - XI, SRES, - /zmio) 
and (reg < SLIMo - SRES/j) 
then Teq else 0 end; 
SRES£) — SRESd 4- f 

{where '5/ € (0 . . . 255]} 

(STREAMJIESERVE.TEST / u st) 

req — if 'st = 0 then u else 'st end; 
i — if (re? < S - E/SRES, - limbo) 
and (re? < SLIM/? - sreSd) 
then reg else 0 end; 

SKZSd SRES£) + t 

{where 6 (0...255]} 

(STREAM-RESERVE.UPTO i u sO .... u ^ v« ... ^ 

64 47 42 37 3i 27 21* " 0 

reg — if 'st = 0 then u else end; 

/ — niiii(reg,S - X], SRES, - /zmio, max(SLlMD -SRESi),0)); 

SRESxp — SRESo + i 

{where si 6 (0 . . .255]} 
(STREAM-RESERVE-UPTO-TEST f ti 50 ... i iz 1 5i 09 ... A 

64 47 42 37 3& 27 31 0 

rtq — if 'st = 0 then u else 's/ end; 

f ^ niin(rcg,S - X;;, SRESi - /im6o,max(SLiMr> - sreso,0)); 

SRES/) — SRES£) + ^ 

{where 'ste [0...255]} 

These operations are used to reserve streams prior to creating streams with the STREAMXREATE 
operation. 

The number of streams reserved for the protection domain to which this stream belongs, SRES^, 
is incremented by req if possible. The result register t reflects the amount by which SRES/? was 
actually changed. The register u is an unsigned integer. 

The resulting stream reservation will not exceed the larger of the current reservation and SLiMz), 
the maximum number of streams allocated to this protection domain. In addition, the sum of all 
reser^^ations cannot exceed S, the number of streams available in the processor. The operating 
system may encourag€(inhibit) parallelism by setting SLiMf) above(below) SRES^?. 

The .TEST versions of these operations never generate overflow/ NaN, and generate carry if the 
reser\-ation SKZSo was changed by exactly req. 

RAISES 

(nothing) 
SEE ALSO 

STREAM-CREATE, STREAM.QUIT 



. t u 0 st . 

47 42 37 35 27 21 



... t U 0 St OS . 
64 47 42 37 3i 27 2: 



STREAM-RESERVE. 
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(TARGET JDISP tn offset) ... m 2 offset OA ... A 

S4 4: 44 42 2: 2) 0 

in — ssw.pc + 'offset 

{where offset £ [-2^^ ... 2^^ - 1]} 

(TARGET JNDEX u) ... in 3 u 000 OA A 

64 47 44 42 Zr 27 21 0 

in — ssw.pc + u 

(TARGET JLESTORE /n u) ... l u 000 OA A 

64 47 44 42 37 27 21* * ' 0 

tn ^ U 

(TARGET-SAVE X m) ... x 0 0 F06m C ' 

64 21 26 13 II 7 6 3 0 

X Strea7nSiatusWord{ssv^.cv,ssv:.tTn,ssv/.md,in) 

These operations establish target values for the program counter to be used by a subsequent branch 
operation, i.e. one of the operations from the JUMP, family, and LEVEL JITN. 

There are eight target registers addressed by tn. The TARGET_DISP and TARGETJNDEX 
operations set the addressed target, using the values currently in the ssw.pc. TARGET-SAVE 
saves the StreamStatusWord with the pc replaced by the addressed target in register x. 

Conversely, TARGETJIESTORE restores the addressed target from register u. 

K field '^priv.tO'' is set in the program state descriptor, setting target zero is supervisor-privileged; 
This allows trap-handlers to be trustworthy. See §8.5. 

When a target register is loaded, the corresponding program instructions are prefetched; see §7.2. 
Separating the TARGET from the JUMP allows the instruction fetch latency to be hidden. 



RAISES 

privileged 
COUNTS AS 

CNT-TARGET if not TARGET.SAVE 
SEE ALSO 

JUMP^ LEVELJITN, SSWJDISP 

Level Manipulations Operations 



TARGET. 
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jcessor Pnnciples of Operation 



(TRAPJIESTORE tr y) 

(trap register at 'tr) — y 
{where 'tr € [0 ... 7]} 

(TRAP_SAVE X tr) 

X — (trap register at 'tr) 
{where 'tr 6 [0 . . . 7j} 



• 00 y tr22 



... I tr 03 00 

*i 21 1$ 11 C 0 



^^Tk T rT '° ^^^P By convention, these operations are 

used by the trap handler. The TRAP JIESTORE operation does not check for poison on y m 
exception aUows the trap handler to free a register without raising a spurious pohon exception 
Trap registers are allocated dynamically; see §9.2. 



RAISES 
(nothing) 



TRAP. 
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(UNSXEIL t u) ... t u 18 OB 08 ... A 

64 4: 42 3r Z2 27 2J 0 

t — unsigned integer ceiling of float u 
(UNS-CEIL.TEST t u) ... ? u 18 OB 09 A 

64 47 42 37 32 27 21 0 

t — unsigned integer ceiling of float u 
(UNS.CHOP r u) t u 18 09 08 ... A 

64 4: 42 37 32 27 21 0 

t — unsigned integer chop of float u 
(UNS-CHOP-TEST i u) ... f u 18 09 09 ... A 

64 47 42 37 32 27 23 0 

t — unsigned integer chop of float u 
(UNSJLOORfu) ... t u 18 OA 08 ... A 

64 47 42 37 32 27 21 0 

if — unsigned integer floor of float u 
(UNS J^LOOR.TEST ? u) ... f u 18 OA 09 ... A 

64 47 42 i7 32 27 22 0 

t — unsigned integer floor of float u 
{VNSJNEAR t u) ... f ti 18 08 08 ... A 

64 47 42 37 32 2T 21 0 

t ^ unsigned integer nearest float u 
(UNS-NEAR.TEST t u) ... f u 18 08 09 ... A 

64 47 42 37 33 27 21 0 

t unsigned integer nearest float u 
(UNS -ROUND f u) ... ^ u 18 OE 08 ... A 

64 47 43 37 32 27 21 0 

i ^ unsigned integer round of float u 
(UNS-ROUND.TEST f ti) ... f u 18 OE 09 ... A 

64 47 42 37 32 27 21 0 

t unsigned integer round of float u 

These operations convert floating-point numbers into unsigned integers. The roundings axe directed 
as in IEEE Standard 754. UNS -ROUND uses the rounding mode in the ssw. 

A floatJn\'aIid exception is raised when the result is negative or too large to represent. In these 
cases the result is reduced modulo 2^. 

The ..TEST versions of these operations never generate cany or overflow/NaN. 



RAISES 

float invalid. floatJnexact 
SEE ALSO 

FLOATJNT, FLOAT.UNS, INTXEIL, INT.CHOP, INT^LOOR, LNTJ^EAR, INTJLOUND, 
FLOAT.CEIL, FLOAT.CHOP, FLOAT JLOOR, FLOAT-NEAR, FLOATJLOUND 



Unsiened Ooerations 



TTKTC 
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,essor Principles of Operation 



(UNS^DD.CARRY.TEST x y z) x v z2i C 

^ y + r + CV; ,carTy^ integer 

(UNS^UB.CARRY.TEST z y z) . . . z y . 23 C 

X — y + -"2 + CV/ .carry, integer 

These operations are intended to be used in multi-word integer add. subtract, and m.Utiply. Note 
tHat carry-m is taken from cvj, to simplify use of these operations in loops. 



RAISES 

(nothjiig) 
SEE ALSO 

INT-A.DD, INT^UB 

UNS_\DD-CARRY. 
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(UNS_A.DD J^UL.UPPER t u v u-) . . . t u v u- 2C . . . A 

t — (u -r V* unsigned 

(UNS^DDJvlUL.UPPER.TEST t u v w) ... t u v w 2D A 

64 47 42 ;r 32 27 21 0 

t ^ [u V » w)/2^ . unsigned 

The UNS-.A.DD-MUL-UPPER operation is used to implement multiple-precision integer multipli- 
cation. 

The operation produces the high 64 bits of -f v * w. When u, t\ and w contain 52- bit unsigned 
integers, the only possible effect of u on this result is via a carry from its position in the low 52 bits. 
The low bits of the unsigned add-multiply may be obtained from an IXTJIDD J^IUL operation, 
even though its operands are ordinarily interpreted as signed two's-complement values. 

There is no UPPER^UB-MUL because multi-word subtract-multiply does not use it. 

The -TEST version of this operation never generates overflow/NaN or carry. 

If r or XT is outside [-2^^ ... 2^^ - 1], the float .extension exception is raised. 



RAISES 

float-extension 
SEE ALSO 

INT^DJ^UL, INT^UBJ^UL, INT^UB-MUL JLEV 

Unsigned Operations UNS.ADD3'IUL.UPPRT? 



.-ssor Principles of Operation 



(UNSJDIV < « V u;) ... t n V w IC ... A 

64 4T 42 3: 32 27 21 0 

exp — unbiased exponent of w 
temp — V * round to floor 

t ^ temp ♦ 2*^^^, round to floor 

(UNSJDIV.TEST t u v w) ... t u v w Id ... A 

64 47 43 i7 Z2 27 21 0 

exp — unbiased exponent of w 
temp — p ♦ round to floor 

t ^ temp ♦ 2*^, round to floor 

These operations are used to implement integer division. The product from unsigned v and Spe- 
cialFloat64 w is shifted right according to exp and rounded, producing an unsigned integer. 

The .TEST version of this operation generates carry when the quotient is not exact, i.e. when the 
division by 2'^^^ yields a non-zero remainder. This operation never generates overflow. 

If V is outside [0...2^^ - 1], the float jextension exception is raised and the result in t may be 
incorrect. 

-\lthough register u is not used in the current hardware implementation, the software requires u to 
contain the denominator in order to properly handle float .extension exceptions. 



RAISES 

float jextension 
SEE ALSO 

INTJDIVXHOP, INTJDIV JLOOR, §12.6 



UNSJ)IV. 
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(UNS J.OADB r s) ... r 5 7... M 

64 61 i6 il 4: 0 

T zero €xtend(byte at 5). with FE-NORMAL 
(UNS-LOADB-AC_DISP r s ac disp) ... r 5 D ... ac disp3 MC 

64 61 U i\ 4: 21 16 2 0 

r ^ zero extend(byte at 5 + moif 2^^), with 'ac 
{where 'disp € [0 . . . 16383]} 

(UNS_LOADB^C_INDEX r 5 ac y) ... r 5 F ... ac t/ OE 39 MC 

64 61 56 &1 47 21 16'='l} 6 0 

r — zero €xtend(byte at 5 -h y mo(i 2^®), with 'ac 
(UNSJLOADBJDISP r s disp) ... r 5 D ... disp 2 MC 

64 61 66 51 47 21 ^^7 0 

r *- zero extend(byte at 5 4- 'disp mod 2^®), with FEJS'ORMAL 
{where 'disp G [0 . . . 524287]} 

(UNS JLOADB JNDEX r 5 y) . .. r 5 F ... 00 y OE 38 MC 

64 61 56 &1 47 21 16 11 6 0 

r — zero extend(byte at 5 + y mod 2^^), with FE-NORMAL 

These operations load an unsigned byte from memory. The fccontrol is taken from the ac field if 
present, or forced to FEJS'ORMAL. If ac is present, its forward, data trapO, and data trapl disable 
bits are used; otherwise those of s are used. 



RAISES 

dataJiwjerror, data.prot, data.^gnment, data-blocked 
COUNTS AS 

CNTJ.OAD 
SEE ALSO 

STOREB, INTJLOADB 
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(UNS J.OADH r 5) ... r s S ... M 

(4 «1 it il 47 0 

r — zero extend(halfword at s), with FE_NORMAL 
(UNS J,OADH_ACJDISP r s ac disp) ... r s d ... ac sdisp 9 MC 

64 61 H il 47 21 16 ''^4 0 

r - zero extend(halfword at 5 + 'disp mod 2*^), with 'ac 
{where 'disp € [0 . . . 16383], 'sdisp = 'disp/4} 

(UNS JLOADH_AC_INDEX r s ac y) ... r s ¥ ... ac y OA 39 MC 

64 61 66 61 47 21 16^11 6 0 

r — zero €Xtend(halfword at 5 + 4 * y mod 2^^), with 'ac 
(VNSJLOABKJDISP r s disp) ... r s D ... sdisp 8 MC 

64 61 46 SI 47 21 0 

r - zero extend(halfword at 5 + '<f:sp mod 2*^), with FE_NORMAL 
{where 'disp 6 [0 . . . 524287], 's(ft5p = 'disp/4} 

(UNS_LOADH_[NDEX r s y) ... r s F ... 00 « OA 38 MC 

64 61 46 41 47 21 16*11 6 0 

r - zero extend(halfword ats + A^y mod 2*^), with FE_NORMAL 

These operations load an unsigned halfword from memory. The fej:ontrol is taken from the ac 
field if present, or forced to FE_\ORMAL. If ac is present, its forward, data trapO, and data trapl 
disable bits are used: otherwise those of s are used. 



RAISES 

dataJiw^rror, data_prot, data-alignment, data_bloci£ed 
COUNTS AS 

CNT-LO.A.D 
SEE ALSO 

STOREH. INT-LOADH 

UNSJ-OADH. 
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(UNS_LOADQ r s) ... r s 6 ... M 

r — zero extend(quarterword at s), with FE3'0RMAL 
{VT^SJLOADQJiCJDISP r s Qc dispj ... r 5 D ... ac sdisp S MC 

64 CI U M 47 21 16 3 0 

r ^ zero extend(quartenvord at 3 + 'disp mod 2"*®), with 'ac 
{where 'disp € [0 . . . 16383], 'idi^p = 'di$p/2} 

(UNS JLOADQ-AC JNDEX r s ac y) ... r 5 F ... ac v OC 39 MC 

64 61 56 51 47 21 IC^ll 6 0 

r — zero €Xteiid(quarterword at 5 + 2 ♦ y mod 2"*^), with 'ac 
(UNS J.OADQ J3ISP r 5 £/:5;?) ... rsD ... 5(ii5p4 MC 

64 61 56 51 47 21 ^^3 0 

r zero ext€nd(quarterword at 5 + mocf 2"^^), with FE3-0RMAL 
{where 'disp € [0 . . . 524287], 'sdisp = 'disp/2} 

(UNS J.OADQ JNDEX r 5 y) . . . r s F . . . 00 y OC 38 MC 

64 61 56 51 47 21 16 11 6 0 

r — zero ext€nd(quarterword at 5 + 2 * y mod 2^*), with FE-NORMAL 

These operations load an unsigned quarterword from memory. When the destination register r 
is rO with UNSJLOADQ, no operation is performed. The fe-control is taken from the ac field if 
present, or forced to FEJ^'ORMAL. If ac is present, its forward, data trapO, and data trapl disable 
bits are used: otherwise those of s are used. 



RAISES 

dataJiwjerror, data.prot, data-alignment, dataJblocked 
COUNTS AS 

CNTJ.OAD 
SEE ALSO 

STOREQ, INT-LOADQ 
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(UNS JIECIP^HIFT zy) . . . r y OF 00 C 

X — log2y, round to ceiling 

(UNS JlECIP^HIFT.TEST x y) . . . x y OF 01 C 

X — log2y, round to ceiling 

These operations are used to compute integer reciprocals. They compute the ceiUng of the log base 
2 of y. When y is zero, x is set to -1. 



RAISES 

(nothing) 
SEE ALSO 

UNSJ[)IV, §12.6 

UNSJIECIP^HIFT. 



Operaiion Descriptions 

(UNS.SHIFT_RIGHT x y z) ^. --..^J.r.^^ C 

I - y»2 

(UNS.SHIFT_RIGHT_TEST i y r) • - ^.^.e^ji-e-^ ^ 

1 - y>z 

These operations shift to the right, filling in O's on the left. Unsigned shift counts in : are taken 
modulo 64. 

The -TEST version generates carry if a 1-bit is shifted out of w or y, and never generates over- 
flow/XaX. 



RAISES 

(nothing) 
SEE ALSO 

INT^HIFTJUGHT, SHIFTJ.EFT, SHIFT J»AIILEIGHT 

Unsigned Ooerations ' UNS^HIFTJUGHT. 
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Chapter 12: Programming Examples 



12.1 Stream Creation 



[[This needs examples.]} 



12.2 Forwarding Pointers 



[[This needs examples.]] 



12.3 Vector Loops 

Consider this Fortran loop to compute the inner product: 

<fortran inner product>^ 

do 100 i = L n 

sura = sum + a(i)*b(i) 
100 continue 
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It could be compiled to the TERA program fragment shown below (omitting most of the preamble, 
and with the loop unrolled): 



<t€ra inner product> = 

(allocate-from-reg Reg 1) 
(assign -reg Reg 

ntrips sum sumoffset 

stackpointer 

ai apointer zistep ^ 
bi bpointer bstep 

) 

(equ ZERO rO) 

(inst 1 (INT.LOAD ai apointer) 

(TARGET.DISP tl looplOO) lo 
(REG.MOVE sum ZERO)) 

(label loop 100) 

(inst 0 (INT.LOAD bi bpointer) 

(INT_ADD apointer apointer astep) 

(INT.ADD bpointer bpointer bstep)) »» 
(inst 1 (INT.LOAD ai apointer) 

(FLOAT.ADD.MUL sum sum ai bi) 

(INT_ADD apointer apointer astep)) 
(inst 0 (INT.LOAD bi bpointer) 

(INT.SUB.IMM.TEST ntrips ntrips 2) „ 

(INT.ADD bpointer bpointer bstep)) 
(inst 1 (INT.LOAD ai apointer) 

(FLOAT.ADD.MUL sum sum ai bi) 

(JUMP if_Ue cO tl)) 
(inst 0 (STORE_i:'DEX sum stackpointer sumoffset)) m 

The TARGET in the first instruction sets the target's program counter. The lookahead values let 
pairs of instructions in the loop run in parallel. The loop achieves one flop per instruction. 
Another Fortran loop of interest is the following: 



<fortran inner prodiict>= 

do 200 i = IjD 

do 100 j = 1^1 

c(i) = c(i) + a(ij)*b(j) 
100 continue 
200 continue 
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Loop transformations (strip mining and interchange) yield 

<fortran inner product>=i 

do 201 ii = hm - 19,20 
lasti = ii -i- 19 
do 101 j = Iji 

do 1 i = iijasti 

c(i) = c(i) + a(ij)*b(j) 
1 continue 

101 continue 

201 continue 

do 202 i = lasti + l^n 
do 102 j = l4i 

c(i) = c(i) + a(ij)*b(j) 

102 continue 

202 continue 



The do 1 loop may now be unrolled: 



<unrolltd jortmn inner product>^ 

do 201 ii = l.m - 193 

lasti = ii + 19 

do 101 j = l,n 

c(ii) = c(ii) + a(ii, j)*b(j) 

c(II + 1) = c(n + 1) + a(II + lj)*b(j) 

c(II + 2) = c(II + 2) + a(II + 2j)*b(j) 

c(II + 19) = c(n + 19) + a(II + 19J)*b(j) 

101 continue 

201 continue 

do 202 i = lasti + 1^ 
do 102 j = 1^ 

c(i) = c(i) + a(ij)*b(j) 

102 continue 

202 continue 



10 



10 



2ft 
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The do 101 loop now corresponds to ihe TERA assembly language fragment below (again, oniitting 
the preamble): 

<terQ unrolled inner product> = 

(allocate-from-reg Reg 1) 
(assign-reg Reg 

bj bij bpointer bstep 

aij apointer astep abigstep 

c.O c.l C.17 C.18 cJ9 5 
) 

) • • • 

(inst 0 (TARGET.DISP tl looplOl)) 
(label looplOl) 

(inst 0 (INT.LOAD bj bpointer) lo 

(INT.ADD apointer apointer abigstep) 

(INT_ADD bpointer bpointer bstep)) 
(inst 0 (INT.LOAD aij apointer) 

(INT.ADD apointer apointer astep)) 
(inst 0 (INT.LOAD aij apointer) u 

(FLOAT.ADD.MUL c.O c.O aij bj) 

(INT.ADD apointer apointer astep)) 
(inst 0 (INT.LOAD aij apointer) 

(FLOAT.ADD.MUL c.l c.l aij bj) 

(INT.ADD apointer apointer astep)) 20 

(inst 0 (INT.LOAD aij apointer) 

(FLOAT.ADD.MUL c_I7 c.l7 aij bj) 

(INT.ADD apointer apointer astep)) 
(inst 0 (INT.LOAD aij apointer) 35 

(FLOAT.ADD.MUL c.l8 c.l8 aij bj) 

(INT.SUB_IMM.TEST ntrips ntrips 1)) 
(inst 7 (FLOAT.ADD.MUL c.l9 c.l9 aij bj) 

(JUMP if.ile cO tl)) 
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The lookahead values of 0 throughout most of the loop could be increased bv using distinct aii 
and apointer registers for each value of but this consumes more registers aid thereby decreases 
the unrolbng factor. 

Minimal lookahead and an unrolling factor of 20 yields 40 flops everv 22 instructions or about 1.9 
flop/instruction (This figure could be improved to about 80 flops every 42 instructions bv unrolling 
two uerations of the j loop.) On the other hand, a sustained lookahead of 7, the maximum possible 
can be had by unrolling 8 iterations of the i loop and 2 of the j loop. This approach still vields 
a respectable 32 flops everj- 18 instructions, or about 1.8 flop/instruction, with eight times fewer 
streams needed. Once again, the somewhat lengthy preamble is omitted; in this case, it involves 
loading not just 8 ci s but 8 aij's as well. The loop uses 23 registers, so it is quite reasonable to 
expect a compiler to unroll as fully as this: 



<tera fully unrolled inner product>= 

(allocate- &om-reg Reg 1) 
(assign -reg Reg 
ntiips 

aOj alj a7j apointer ajstep 
bjO bjl bpointer bstep cO cl c7) 
(define AISTEP (double 'SIZE)) 

(inst 0 (TARGET DISP tl loop)) 
(label loop) 

(inst 7 (INT.LOAD aOj apointer) 

(FLOAT.ADD.MUL cO cO aOj bjO) 
(INT.ADD bpointer bpointer bstep)) 

(inst 7 (INT.LOAD.DISP alj apointer AISTEP) 
(FLOAT.ADD.MUL cl cl alj bjO)) 

(inst 7 (INT.LOAD.DISP a7j apointer (» 7 AISTEP)) 
(FLOAT.ADD.MUL c7 c7 a7j bjO)) 

(inst 7 (INT.LOAD bjO bpointer) 

(INT.ADD apointer apointer ajstep) 
(INT.ADD bpointer bpointer bstep)) 

(inst 7 (INT.LOAD aOj apointer) 

(FLOAT.ADD.MUL cO cO aOj bjl) 
(INT.SUB.IMM.TEST ntrips ntrips 2)) 

(inst 7 (INT.LOAD.DISP alj apointer AISTEP) 
(FLOAT.ADD.MUL cl cl alj bjl)) 

(inst 7 (INT.LOAD.DISP a7j apointer (' 7 AISTEP)) 
(FLOAT.ADD.MUL c7 c7 a7j bjl)) 

(inst 7 (INT.LOAD bjl bpointer) 

(INT.ADD apointer apointer ajstep) 
(JUMP if ile cO tl)) 
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12.4 Doubled Precision Floating-point Arithmetic 



The TERA architecture provides support for r28-bit "doubled precision" floating-point arithmetic. 
A doubled precision representation is an ordered pair [X,x] of floating-point numbers in which x 
is insignificant compared to A', that is. round(A' + i) = X. The only rounding mode supported in 
doubled precision arithmetic is "round to nearest". 

We say [X,x] is the doubled precision representation of the real number ^ if A' = round(f) and 
X = round(^ - round(O)- It follows from the definition that round(A' + x) = X. Testing the more 
significant part of a doubled precision value is sufficient because of this "normalization" property. 
Another important property is this: if A and B are floating-point numbers, then there is a unique, 
exact doubled precision representation for A + B. 
The representation [X, i] of ^4 + 5 is computed as follows: 

< doubled precision math> = 

(define (doubled.add.single X x A B tempi) 
(inst 0 (FLOAT.MMAX tempi A B) (FLOAT.ADD X A B)) 
(inst 0 (FLOAT.MMIN tempi A B) (FLOAT.SUB x tempi X)) 
(inst 0 (FLOAT.ADD x x tempi)) 
) 



A doubled precision floating-point add. [Z.z] = [.Y.i] + [Y,y], is computed like this, where taaq)l, 
tenp2, tenp3, and teiiip4 are temporary registers that are distinct from those holding Z or z: 

< doubled precision math> = 

(define (doubled_add Z z X x Y y tempi temp2 temp3 temp4) 
(let* ((A tempi) (a temp2) (B temp3) (b temp4) 
(C B) (c a) (t z) (u Z) 

) 

(inst 0 (FLOAT.MMAX t X Y) (FLOAT.ADD A X Y)) » 

(inst 0 (FLOAT.MMLN t X Y) (FLOAT.SUB a t A)) 

(inst 0 (FLOAT.ADD a a t) (FLOAT.ADD B x y)) 

(inst 0 (FLOAT.MMAX t x y) (FLOAT.ADD a a B)) 

(inst 0 (FLOAT.SUB b t B) (FLOAT.ADD C A a)) 

(inst 0 (FLOAT.MMIN u x y) (FLOAT.SUB c A C)) ,o 

(inst 0 (FLOAT.ADD c c a) (FLOAT.ADD b b u)) 

(inst 0 (FLOAT.ADD c c b)) 

(inst 0 (FLOAT.ADD Z C c)) 

(inst 0 (FLOAT.SUB z C Z)) 

(inst 0 (FLOAT.ADD z z c)) „ 
) 

) 
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Doubled precision floating-point subtract is similar, with the y and Y operands explicitly negated. 

Kdovbled prtcision math>= 

(define (doubledjub Z 2 X x Y y tempi temp2 temp3 temp4) 
(let* ((A tempi) (a tcmp2) (B temp3) (b temp4) 
(C B) (c a) (t 2) (u Z) 
) 

(inst 0 (BIT.MASK a 63 63) (FLOAT.SUB A X Y)) * 

(inst 0 (BIT.XOR a Y a) (BIT.XOR u y a)) 

(inst 0 (FLOAT.MMAX t X a) (FLOAT.SUB B x y)) 

(inst 0 (FLOAT.MMIN t X a) (FLOAT.SUB a t A)) 

(inst 0 (FLOAT.MMAX t x u) (FLOAT.ADD a a t)) 

(inst 0 (FLOAT.SUB b t B) (FLOAT.ADD a a B)) lo 

(inst 0 (FLOAT.ADD C A a)) 

(inst 0 (FLOAT.MMIN u x u) (FLOAT.SUB c A C)) 

(inst 0 (FLOAT.ADD c c a) (FLOAT.ADD b b u)) 

(inst 0 (FLOAT.ADD c c b)) 

(inst 0 (FLOAT.ADD Z C c)) » 
(inst 0 (FLOAT.SUB z C Z)) 
(inst 0 (FLOAT ADD 2 z c)) 
) 
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The doubled precision product [Z.z] = [X.x] x [Y.y] is computed as follows, where tempi and 
teiap2 must be temporary registers distinct from those holding Z, z. X. x. Y. or y. This code relies 
on the fact that the floating-point mulliply-add operations only round once. 

<doubled precision math> = 

(define (doubled.mul Z z X x Y y tempi temp2) 
(let' ((a tempi) (b temp2) 

(c tempi) (d temp2) (e z) (f temp2) 

(ZERO rO) 

) 

(inst 0 (FLOAT_MUL_LOWER a ZERO X y)) 
(inst 0 (FLOAT.MUL.LOWER b ZERO x Y)) 
(inst 0 (FLOAT.ADD.MUL c ZERO X Y) 

(FLOAT.ADD dab)) 
(inst 0 (FLOAT.MUL.LOWER e c X Y)) ,o ' 

(inst 0 (FLOAT.ADD f d e)) 

(inst 0 (FLOAT.MMAX z c f) (FLOAT.ADD Z c f)) 
(inst 0 (FLOAT.MMLX c c f) (FLOAT.SUB z z Z)) 
(inst 0 (FLOAT.ADD z z c)) 

) 

) 



An important special case occurs when products of working precision numbers X and Y are to be 
computed in doubled precision [Z, z]: 

<doubled precision math> = 

(define (doubled.single.mul Z z X Y) 
(let ((ZERO rO)) 

(inst 0 (FLOAT.ADD.MUL Z ZERO X Y)) 
(inst 0 (FLOAT MUL LOWER z Z X Y)) 
) 

) 
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12.5 Floating-point Division and Square Root 



While directly implementing floating-point division was deemed infeasible, the TERA architecture 
provides support for correctly rounded IEEE division. Starting with a reciprocal approximation, 
the adder-muJtipIier is used to compute a correctly rounded quotient. 

The floating-point divide g = j/j/ is computed like this, where q, x and y are held in registers and 
tempi is an additional temporary register which must be distinct from the first three. 

< floating divide>= 

(define (fioat.divide q x y tempi temp2) 
(let ((r tempi) (e temp2)) 
(float.reciprocal r y q) 
(inst 0 (FLOAT_DIV_APPROX q y x r)) 

(inst 0 (FLOAT.DIV.ERROR e x y q)) * 
(inst 0 (FLOAT DIV q q e r)) 

) 

) 



< floating divide> = 

(define (float.reciprocal r y tempi) 
(let ((e tempi)) 

(inst 0 (FLOAT.RECIP.APPROX r y)) ; traps if y is denorm 
(inst 0 (FLOAT.RECIP.ERROR e y r)) 

(inst 0 (FLOAT.ITER r r e r)) 

(inst 0 (FLOAT.RECIP.ERROR e y r)) 

(inst 0 (FLOAT ITER r r e r)) 

) 

) 

(define (float .reciprocaI_denonn r y) 
(let ((tempi r)) 

(inst 0 (INT.IMM tempi 1074)) 

(inst 0 (FLOAT.SCALB tempi y lempi)) 

(inst 0 (INT.RECIP.APPROX r tempi)) 

) 

) 
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The first FLOATJlECIP-\PPROX gives a reciprocal thai is accurate to 14 bits. The FLOAT.- 
RECIP-ERROR and FLOAT JTER pairs form a Newton iteration, which raises the accuracy to 28 
and then 54 bits. The product of the reciprocal r and the numerator x is a quotient q correct to 1 
ulp. The final instructions compute a remainder to correctly round q and deliver the final quotient. 
When the divisor y is denormalized, the trap handler should execute the floatjeciprocal.denorm 
code sequence to compute the correct initial reciprocal approximation. 

Computing the square root is performed similarly. The square root q = ,/y is computed like this, 
where q and y are held in registers, and tempi is an additional temporary register which must be 
distinct from the first two. 



< floating square rooi> = 

(define (float ^sqrt q y tempi temp2) 
(let ((r tempi) (e temp2) 
(.float ^sqrt q y r e) 

(inst 0 (FLOAT.SQRT.ERROR.TEST e y q q)) 

(inst 0 (FLOAT.SQRT q q e r)) s 
) 

) 

) 

(define (_float_sqrt q y r e) 

(inst 0 (FLOAT.RSQRT_A?PROX r y)) ; traps if y is denorm lo 

(inst 0 (FLOAT.SQRT.APPROX.TEST q y y r)) 

(inst 0 (FLOAT.RSQRT_ERROR.TEST e y q r)) 

(inst 0 (FLOAT.ITER r r e r)) 

(inst 0 (FLOAT.SQRT.APPROX.TEST q y y r)) 

(inst 0 (FLOAT.RSQRT.ERROR.TEST e y q r)) » 

(inst 0 (FLOAT.ITER r r e r)) 

(inst 0 (FLOAT.SQRT.APPROX.TEST q y y r)) 

) 

(define (float .rsqrt.denorm r y) 
(let ((tempi r)) , » 

(inst 0 (EvT.IMM tempi 1022)) 
(inst 0 (FLOAT.SCALB tempi y tempi)) 
(inst 0 (INT.RSQRT.APPROX r tempi)) 

) 

) 



Here, the initial approximation to the reciprocal square root is correct to at least 14 bits. The 
FLOAT-SQRT-APPROX.TEST uses the reciprocal root to compute an estimate of the square 
root. The two iterations improve the accuracy to the necessary 54 bits. The final operation 
computes the correct rounding for the delivered result. When the argument y is denonnalized, the 
trap handler should execute the float_rsqrt_denorm code sequence to compute the correct initial 
reciprocal square root approximation. 
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The TERA architecture provides support for 64-bit integer division, including both signed aad un- 
signed integer data types. The current implementation supports 53-bit integer division in hardware 
and longer operands in software. For signed division, we provide instructions to allow the quotient 
to be rounded toward zero by chopping, as in FORTRAN, or rounded toward negative infinity, so 
that9=LfJ- ^ . 6 

The FORTRAN integer divide q = x/y is computed like this, where q, x and y are held in registers, 
and tanq>l is an additional temporary register, which must be distinct from the first three. 



< integer divide> = 

(define (int.reciprocal r y tempi temp2) 
(let ((fy temp2) 
(e tempi) 

) 

(inst 0 (FLOAT.INT fy y)) 
(inst 0 (INT.RECIP.APPROX r fy)) 
(inst 0 (INT.RECIP.ERROR e fy r)) 
(inst 0 (FLOAT.ITER r r e r)) 
(inst 0 (INT.RECIP.ERROR e fy r)) 
(inst 0 (FLOAT.ITER r r e r)) 
(inst 0 (INT.DIV.CHOP e y y r) 

(INT.ADD.IMM r r 1)) 
(inst 0 (INT.SUB r r e)) 
) 
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The first reciprocal appro>amation is correct to 14 bits. Two iterations raise that accuracy to 54 
bits. The fix step is needed to guarantee that r is correctly rounded to the ceiling of the reciprocal. 
Finally, the divide instruction multiplies by the reciprocal and shifts right to compute the correct 
quotient. 

<integer divid€> = 

(define (int^divide_chop q x y tempi temp2) 
(let ((r tempi) 

) 

(int.reciprocal r y q temp2) 

(inst 0 (INT.DIV.CHOP q y x r)) » 

) 

) 

(define (int_divide_chopjest q x y tempi temp2) 

(let ((r tempi) 

) • 

(int .reciprocal r y q temp2) 

(inst 0 (INT.DR-.CHOP.TEST q y x r)) 

) 

) 

Signed integer division with floored rounding is analogous, except the final INTJDIVXHOP is 
replaced with INT -DrV^JFLOOR so that the sign of the (implied) remainder agrees with the de- 
nominator rather than the numerator. 

<int€ger divid€> = 

(define (int.divide.floor q x y tempi temp2) 
(let ((r tempi) 

) 

(int_reciprocaI r y q temp2) 

(inst 0 (INT.DR'.FLOOR q y x r)) » 
) 

) 

(define (int.divide.floorjest q x y tempi temp2) 
(let ((r tempi) 
) 

(iiit_reciprocal r y q temp 2) 

(inst 0 (INT_DIV.FLOOR_TEST q y x r)) 

) 

) 



12.6 Integer Division 



192 



The unsigned quotient is computed siniilarly: 

Kunsigned divide>= 

(define (uns.reciprocal r y tempi temp2) 
(let ((fy temp2) 
(e tempi) 

) 

(inst 0 (FLOAT_UNS fy y)) 
(inst 0 (INT.RECIP.APPROX r fy)) 
(inst 0 (INT.RECIP.ERROR e fy r)) 
(inst 0 (FLOAT.ITER r r e r)) 
(inst 0 (INT.RECIP.ERROR e fy r)) 
(inst 0 (FLOAT.ITER r r e r)) 
(inst 0 (UNS.DFV e y y r) 

(INT.ADD.IMM r r 1)) 
(inst 0 (INT.SUB r r e)) 
) 

) 



Kunsigned divide>= 

(define (uns_divide q x y tempi) 
(let ((r tempi) 
) 

(uns.reciprocal r y q) 

(inst 0 (UNS DFV q y x r)) 

) 

) 

(define (uns.divide.test q x y tempi) 
(let ((r tempi) 

) 

(uns.reciprocal r y q) 

(inst 0 (UNS.DIV.TEST q y x r)) 

) 

) 
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Note that when the divisor is constant, the sequence simplifies down to one compute instruction (and 
one constant which must be materialized). For e.xample. the following two instructions compute 
q = 2/20. 

< integer divide by 20> = 

(define (int_divide_20 q x) 
(let* ((r q)) 

(const FIFTH (+ (- (quotient (+ (expt 2 56) 4) 5) (expt 2 53)) 

(• (- 510 3) (expt 2 53)))) 
(inst 0 (LOAD.DISP r LINKAGE.PTR FIFTH)) * 
(inst 0 (INT.DIV.CHOP q rO x r)) 

) 

) 
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Chapter 13: I/O Processor Introduction 

The I/O processor contains four instruction streams and a control word. The control word is used to 
assign a segment descriptor for fetching instructions and initializing the streams. The four streams 
have dedicated purposes: memory load, memory store, HIPPI input and HIPPI output. For each 
stream, instructions are fetched and executed in a linear fashion. If an exceptional event occurs 
during the execution of an instruction, the stream performs a link operation with the driver to 
inform it of the exception. A link operation is defined as a stream status word production, followed 
by a program counter consumption. The driver program must use consumer/producer semantics 
on the link address and the new control word address, respectively. The I/O processor ignores the 
forward, data trap, and memory full bits when fetching instructions. 

o The memory /oad stream loads from data or I/O memory into the outbound data buffer. 

o The memory store stream stores from the inbound data buffer into data or I/O memory. 

o The HIPPI output stream transfers bursts of data from the outbound data buffer out through 
the HIPPI interface. 

o The HIPPI input stream transfers bursts of received data into the inbound data buffer. 

The HIPPI interface section of the lOP is designed to conform with the physical layer specification 
of the ANSI X3T9.3 committee. The basic HIPPI clock rate is 25 Mhz, and the timeout clock 
period is 1 microsecond. The longest timeout period is 16.8 seconds. The HIPPI out section of the 
lOP has an external, fixed 50 Mhz clock which is used to generate the 25 Mhz timing and the lOP 
timeout clock. The HIPPI in section synchronizes to the source clock for its connection, yet uses 
the fixed timeout clock generated by the HIPPI out section. 

Each half of the lOP can operate as either a 32- or 64-bit HIPPI channel. The channel width can be 
selected when making each connection. The load and store streams operate only on 64-bit words. 

On power up, the lOP must not request or accept any connections until it has been initialized. To 
satisfy this requirement, some (external) power on reset circuit is needed. In addition, connections 
must be inhibited during scan. 

13.1 Link Status Word 

Each stream in the I/O processor produces a link status word when it links. That word generally 
indicates the reason for linking, and the program counter at which the link occurred. The program 
counter in both link words is a byte offset into the instruction segment. The lower three bits are 
ignored, since the lOP only performs full word, aligned, instruction fetches. The field ""exception"^ 
is set whenever one of the exceptions in bits 53 to 39 is set. The field "status Jink'' is set whenever 
the field ""pc" is not a program counter, but instead is some data such as an Ifield or error offset. 



10? Status Word 
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Valid Bits Wd Field Name 



Type 



Description 
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lOPStatusWord: Exceptions 



LSOI 


63 1 


exception 


Flag 


Exception 


LSI 


62 1 


cratiic linL" 

9 vat Ud,.iJil A 


r lag 


Status result 




61-56 6 


nc fiplH 


T'nc 


Reserved for 0/S use; 10 F writes 0 










reserved 


LSOI 


53 1 


forcedJLink 


Flac 




LSOI 


52 1 


link-error 


Flag 


Link Error 


LbUl 


ol 1 


pJanut.error 


Flag 


Instruction Segment Limit Error 


LSOI 


. 50 1 


unimplemented.- 
address 


Flag 


Instruction Address unimplemented 
on resource 






p.uncorreciaDie.- 
error 


r lag 


Uncorrectable Instruction Error 


LSOI 


48 1 


illegal jDp -code 


Flag 


Illegal OP code 


S 


47 1 


packet.end 


Flag 


Packet End exception 


LSI 


46 1 


limit. error 


Flag 


Segment Limit Error 


LSI 


45 1 


unimplemented-- 
address 


Flag 


Address unimplemented on resource 


LSOI 


44 1 


uncorrectable.- 
error 


Flag 


Uncorrectable Data Error 


01 


43 1 


connectJost 


Hag 


Connect lost 


01 


42 1 


interconnectJost 


Flag 


Interconnection lost 


01 


41 1 


bad-connect 


Flag 


Unable to request connection 


01 


40 1 


timeout 


Hag 


Time out 


0 


39 1 


nojconnection 


Flag 


No connect before OUTJ^ACKET 




38-35 4 


0 




reserved 


*StatusWord: Status 








LSOI 


34-33 2 


StreamJdentity. 


lopStream 


Stream Identit}' 


LSOI 


32 1 


loopback 


Hag 


Loopback 



lOPStatusWoTxi: PC 

LSOI 31-0 32 pc Uns Program Counter or Ifield 

The field '^StreamJdentity" is encoded using the following enumeration. 
Name Value Meaning 

lopStream 

lOP J.OAD 0 Load stream 

lOP.OUT 1 Out stream 

lOP^TORZ 2 Store stream 

lOPJN 3 In stream 

The link status words axe arranged at word indices 0 through 7 in the lOP instruction segment. 
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Each stream has a pair, status and next pc. These are laid out in the following order: load^tatus, 
loadjiext.pc, outjstatus. out_next.pc, store^tatus, storejaext.pc, in-staius, in^ext«pc. 



I/O Operation Descriptions 
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Chapter 14: I/O Operation Descriptions 

I/O Processor programs have the same syntactic form as Lisp expressions. The CPU instructions are 
composed of several operations, 10? instructions always contain exactly one operation. Therefore, 
the INST wrapper is not needed in lOP assembly programs. 



Initialization Operations 
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(INST_SEGMENT disLen m.type unit Hmit base) 

, 00 0 tfzsLen m.type 0 unit 0 limit 0 base I 

InstSegment — immediate data 

The segment word holds the data address translation descriptor for lOP instructions. The format 
IS the same as data map entries in the processor except that the load and store levels and locked 
bit are omitted. 

The I/O Processor is reset or initialized through the segment word. The segment word is located 
at word offset 0 for the logical unit number assigned to the lOP. Whenever the segment word is 
wntten to, aU four streams perform a link operation in the new segment. .A.s there is only one 
program segment for all four streams, the lOP should be in an idle or suspect state before changing 
the segment descriptor. X write of the segment word also resets the inbound and outbound data 
buffers. Table 1 shows the bit aUocation for the stream status returned by a link operation. 

The p Jimit.error. p.unimplemented-address, and p.uncorreciable-error exceptions during a link 
operation cause the stream to retry the link operation, possibly indefinitely. 

Instructions are provided so that one stream may cause another stream to link. Generally, streams 
forced to link wiU link between instructions. However, a stream will abort an indefinite wait to 
link, so that deadlocked streams may be interrupted and reset. 



lOPJlESET 
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(LOADJLINK) 

Link Load stream 



64 



10 00000000000000 



0 



I 



(LOADJLINK.OUT) 



14 00000000000000 



I 



0 



Link Out stream 

The LINK operation forces the memory load stream to perform a link operation with the device 
driver. The link operation pair for the memory load stream is located at word offset 0 and 1 of the 
program segment for lOP code. No exception is raised. 

The LOADJLINK.OUT operation forces the out stream to perform a link operation. The out 
stream will link at the next opportunity, generaUy after completing the current instruction, but 
potentially by aborting an instruction in progress. The forced link bit will be set in the out stream's 
status word. 



R-^ISES 
(nothing) 



Memory Load Stream Operations 



LOADJ-INK- 
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(LOAD-SEGMENT disLen mjype unit limit base) 



LoadSegment — immediate data 



0 



This operation loads an address translation descriptor to be used by the memory load stream for 
tetchjng data. The format is the same as data map entries in the processor except that the load 
and store levels and locked bit are omitted. 

There are no exceptions for the LOADSEGMENT opcode. 



RAISES 
(nothing) 



LOAD-SEGMENT. 
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(loadj:rr.offset) 



12 000000000000000 



64 



Status. pc — Error offset 

This operation always links. If there are no masking exceptions, this operation stores the offset 
of the load request which first resulted in an uncorrectable.error. limit.error. or unimplemented.- 
address exception in the pc field of the link status word. The status Jink flag is set. If multiple errors 
are encountered, only the first is reported. Note that the offset returned may not be the lowest 
offset which resulted in an exception. If no errors were present. LOAD JERR^OFFSET returns the 
next offset that load stream will issue to the network. This operation is used for diagnosing the 
failure of a load instruction. 



RAISES 



statusJink 



Memor\- Load Stream Operations 



LOADJERIL 
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(LOADJLUSH) ^^13^^00000000000000^ I 

Flush outbound data buffer ' 

The flush operation clears all data in the outbound data buffer. This operation should be used 
only when the HIPPI out stream is not connected. If the flush operation is used while the HIPPI 
out stream is connected, the connection will be dropped. 



RAISES 
(nothing) 



LO.^DJLUSH. 
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(LOAD_ENDJ>ACKET) 



15 00000000000000 



64 



0 



Mark end of indeterminate length packet 

The end-packet operation indicates to the out stream that the contents of the buffer are the last 

words of an indeterminate length packet. This command should only be used with an OUT.- 
PACKET command of length 0. The load stream will wait until the outbound buffer is emptied 
or the OUT-PACKET command otherwise terminates before continuing with the execution of the 
next instruction. 



RAISES 
(nothing) 



Memory Load Stream Operations 



LOADJEND. 
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(LOAD J)ATA start.offset end.offstt) 2 start.offset en(Loffset I 

64 60 32 0 

Load data 

(LOAD JMAGE starLoffset end,offset) 3 start^offset end^offset I 

64 60 32 0 

Load image 

Load the 64-bit data and state beginning at the start^offset and continuing through all words up 
to end-offset minus one, inclusive. The low-order three bits of the start-offset and €nd_offset are 
ignored. Thus, byte addresses may be used without need for shifting. 

The load stream interprets the end_offset modulo 2^®. To include the last word in a segment, a 
IOOOOOOO16 or 0 may be used as the end-offset. 

All forwarding, data traps, and full/empty operations are disabled during the load operation. The 
load stream issues LOAD^TATE request to the memory resources, which respond with both the 
data and the access state stored at the given address. 

If JMAGE is used, the outgoing data buffer packs the control access fields for sixteen consecutive 
data words into a 64-bit state word and inserts the state word into the data stream after its 
respective data words. When JMAGE is used, the number of words to load from memory must be 
a multiple of 16. In addition, the number of non-packed data words loaded from memory before 
the JMAGE operation occurs must be a multiple of 16. Note that these operations may be used 
together to build a packet with a DATA header and IMAGE payload. 

This instruction need not abort due to a forced link unless there are no free buffers into which to 
load. 

This instruction will wait indefinitely for free space in the outbound buffer. Exceptions for the 
LOADJ)ATA/IMAGE operations are forced-link, uncorrectable-error, limit-error, or unimple- 
mented-address. 



RAISES 

forced-link, uncorrectable.error, limitjerror, unimplemented-address 



LOAD- 
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(STOREJ-INK) 

Link Store stream 



80 00000000000000 



0 



I 



(STORE J.INKJN) 



84 00000000000000 



I 



64 



0 



Link In stream 

The STORE JLINK operation forces the memory store stream to perform a link operation with the 
device driver. The link operation pair for the memory store stream is located at word oifset 4 and 
5 of the program segment for lOP code. No exception is raised. 

The STOREJLINKJN operation forces the in stream to perform a link operation. The current 
instruction of the in stream may be interrupted. The forced link bit will be set in the in stream's 
status word. 



RAISES 
(nothing) 



Memory Store Stream Operations 



STORE J.INK. 
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(STOREJSEGMENT disUen mjype unit limit base) 

81 0 dist^en mJype 0 unit 0 limit 0 base I 

StoreSegment — immediate data 

This operation loads an address translation descriptor to be used by the memory store stream for 
wnting data. The format is the same as data map entries in the processor except that the load 
and store levels and locked bit are omitted. 



RAISES 
(nothing) 



STORESEGMENT. 
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(STORE-REPLICATE starLoffset end^offset) 




9 starLojfset end^offset I 



Store fill data 

This operation is only valid while the lOP is in loopback mode. The LOAD stream should fetch 
the word that is to be replicated. The STORE stream duplicates the item and its access state to 
all the locations between start-offset and end^offset minus one. inclusive. The low-order three bits 
of the start-offset and end-offsei are ignored. Thus, byte addresses may be used without need for 
shifting. If multiple items are fetched by the LOAD stream, only the first item is removed from 
the outbound buffer. 

The store stream interprets the endjoff'set modulo 2^®. To include the last word in a segment, a 
lOOOOOOOie or 0 may be used as the end-offset. 

This instruction need not abort due to a forced link unless the inbound buffer is completely empty. 

This instruction will wait indefinitely for data to appear in the inbound buffer. Exceptions for the 
STOREJIEPLICATE operations are uncorrectable.error, limitjerror or unimplemented-address. 



RAISES 

uncorrectablejerror, limit.erTor, unimplemente<Laddress 



Memory Store Stream Operations 



STOREJIEPLICATE. 
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(STOREJERR_OFFSET) 83 00000000000000 I 

64 56 0 

Status.pc — Error offset 

This operation always links. If there are no masking exceptions, this operation stores the offset of 
the store request which first resulted in an uncorrectable.error, unimplemented^ddress, packet.- 
end, or limit-error exception in the pc field of the link status word. The statusJink flag is set. If 
multiple errors are encountered, only the first is reported. Note that the offset returned may not be 
the lowest offset which resulted in an exception. U no errors were present, STORE^RILOFFSET 
returns the next offset that the store stream would have issued to the network. This operation is 
used for diagnosing the failure of a store instruction. 



R-A.ISES 
StatusJink 



STOREXRR. 
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(STOREJLUSH) 82 00000000000000 I 

64 56 0 

Flush inbound data buffer 

The flush operation clears all data in the inbound data buffer. This operation should only be used 
when the HIPPI input stream is not receiving a packet. If the flush operation is used while the 
HIPPI input stream is receiving data, parts of the incoming data packet may be lost. 



RAISES 
(notlung) 



Memory Store Stream Operations 



STORE-FLUSH. 
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(STOREJNDJ>ACKET) 85 00000000000000 I 

Handle packet end clean up 

This operation flushes all data for the current packet that has not already been stored, finishing 
when end of packet is received. Thus, fill data at the end of a packet can be disposed of using 
this operation. At the extreme, a whole packet of data will be flushed if no STORE J)ATA or 
STOREJMAGE operations are placed between successive STORE_ENDJ?ACKET operations. 
In addition, the uncorrectable.error exception is raised if an uncorrectablejerror was detected during 
the reception of this packet by the in stream. Note that a STORE J:NDJ>ACKET must be issued 
for each packet received by the HIPPI IN stream. This operation always links on completion. 



RAISES 

uncorrectablejerror 



STORE j:nd_packet. 
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(STORE_END-SEGMENT) 




I 



64 



Validate partial packet data 

This operation ensures that the uncorrectable.error exception is raised if an uncorrectable^error 
was detected during the reception of any of the data for this packet which has already been stored. 
Thus, if the end of a burst has not yet been received, but the initial data in the burst ha5 been 
stored, this operation will wait until the LLRC check at the end of the burst has been performed. 
This operation always links on completion. 



RAISES 



unconeciablejerror 



Memory Store Stream Operations 



STORE-END-SEGMENT. 
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(STORE-DATA starUoffset end.offsei) a start.offset endLoffset I 

64 60 ^32 0 

Store data 

(STORXJMAGE start.offset en(Loffset) B starLoffset endLoffset I 

64 60 32 0 

Store image 

Store the 64-bit data and state beginning at the start^offset and continuing through all words up 
to end-offset minus one, inclusive. If an end of packet is signaled before all the indicated words 
are received and stored, the packet-end exception will be raised. The low-order three bits of the 
start j)ffset and endj^ffset are ignored. Thus, byte addresses may be used without need for shifting. 

The store stream interprets the endjoffset modulo 2^^. To include the last word in a segment, a 
lOOOOOOOie or 0 may be used as the end-oflfset. 

When -DATA is used, the access control states will be set to full, no forwarding, no traps. When 
JMAGE is used, the access control states are unpacked from the inbound buffer. When STORE.- 
IMAGE is used, the number of words to store to memory must be a multiple of 16. The store 
stream removes every 17th word and uses the data contained in it to generate the access control 
states for the preceding 16 words. An uncorrectable-error may be caused by failure of the inbound 
buffer or bad incoming HIPPI data. Note that these operations may be used together to scatter 
store a packet with a DATA header and IMAGE payload. 

These instructions will wait indefinitely for data to appear in the inbound buffer. Exceptions for 
the STORE JD ATA/IMAGE opcodes are uncorrectable-error, unimplemented-address, packet jend, 
or limit_error. The unimplemented address and limit«error exceptions should only occur due to 
program errors. 



RAISES 

uncorrectable-error. unimplemented-address. packet.end, limit.error 



STORE- 



I/O Operation Descriptions 



213 



(OUTJ-INK) 

Link Out stream 



64 



41 00000000000000 



0 



I 



(OUT-LINK-LOAD) 



48 OOOOOOOOOOOOOO 



I 



64 



0 



Link Load stream 

The OUT_LINK operation forces the HIPPI output stream to perform a link operation with the 
device driver. The link operation pair for the HIPPI output stream is located at word offset 2 and 
3 of the progrzLm segment for 10 P code. No exception is raised. 

The OUT-LINK-LOAD operation forces the load stream to perform a link operation. The current 
instruction of the load stream may be interrupted. The forced link bit will be set in the load 
stream's status word. 



HIPPI Out Stream Operations 



OUTXINK- 
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(OUTJEONG swap width timeout Ifield) J^^swap^l^^widthjimeoutjfield I 

Request a connection on the HIPPI interface 

The HIPPI request signal is asserted and the 32-bit I-field is placed on the HIPPI data path bits 
31 to 0. All zeros are placed on the remaining HIPPI data bits 63 to 32. The connect signal must 
imtiaUy be deasserted before request can be asserted. If the connect signal was asserted when the 
OUT_RING operation is executed a bad_connect exception is raised. 

This instruction waits until the connect signal is asserted by the destination or timeout occurs. 
Once the connect signal is asserted, the lOP stops transmission of the I-field. The stream then 
tnes to detect a rejected connection. If during this time a ready pulse is received, the stream 
assumes the connection was accepted. If the connect line is deasserted before a ready pulse is 
received, the connection was rejected and a connectJost exception is generated. 
The width field selects either a 32- or 64-bit channel width. When width is asserted, the lOP 
operates in 64-bit HIPPI mode. When width is deasserted, the order of the 32-bit words may be 
reversed by setting the swap bit. 

Exceptions for the OUT JUNG opcode are timeout, connectJost, interconnectJost or bad-connect. 



RAISES 

timeout, connectJost. interconnectJost, bad_connect 



OUTJUNG. 



I/O Operation Descriptions 



215 



(OUTJLOOPBACK timeoui) 4A timeout 00000000 I 

Request loopback connection 

The HIPPI-output stream indicates to the HIPPI-input stream that it wishes to be in loopback 
mode. If the HIPPI-input stream acknowledges with a similar IN-LOOPBACK command before 
timeout occurs. OUTJLOOPBACK mode is established. WTien the lOP is in loopback mode, 
any data and access control state present in the outbound buffer is available for transfer into the 
inbound buffer. Loopback mode continues until terniinated by the HIPPI-input stream or via 
LOADJ-INK.OUT. 

The exceptions for the OUTJ-OOPBACK opcode are timeout, connectJost, and forceJink. 



RAISES 

timeout, connectJost, forceJink 

HIPP! Out Stream Operations ' OUT-LOOPBACK- 
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(OUTJ,0OPMODE) 4, 000001 00000000 1 I 

Select local serial loop bade 

The HIPPI-output stream indicates serial Unk logic that it wishes to be in local serial loopback 
mode. This mode remains in effect until cleared by a subsequent OUTJLOOPBACK instruction. 



RAISES 
(nothing) 

OUTXOOPMODE. 
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(OUT-DISCONNECT timeout) 44 timeout 00000000 I 

Break connection 

This operation completes the HIPPI connection by deasserting the request signal and waiting for 
the destination to acknowledge by deasserting the connect signal. 

If a disconnect is issued while the lOP is in loopback mode, both the HIPPI input and output 
streams will exit loopback mode. 

The exception for the OUT_DISCOXNECT opcode is timeout. 



HIPPI Out Stream Operations 



OUTJ)ISCONNECT- 
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(OUTXANCEL timeout) 



45 timeout 00000000 



I 



64 



0 



Wait for late connect response 

Wait for the connect signal to be asserted or for timeout to occur. After a RING fails, this 
instruction may be used to wait up to a round trip delay for a late connect response to the previous 
request assertion. 

The exception for OUTXANCEL is timeout. 



IL\ISES 
timeout 



OUTXANCEL. 
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fOUTJNTERCONNECT width timeout) 27 width timeout 00000000 I 

64 I' S6 32 0 

Wait for interconnect asserted 

Wait for the interconnection lines for the specified HIPPI width to settle to an active state or for 
timeout to occur. 

The exception for the OUTJXTERCOXXECT opcode is timeout. 



RAISES 
timeout 

HIPPI Out Stream Operations ^ OUTJNTERCONNECT. 
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(OUT_DELAY timeout) 47 timeout 00000000 I 

64 56 33 0 

Wail 

Wait for timeout to occur, then fetch the next instruction. 
There are no exceptions for OUT_DELAY. 



RAISES 
(nothing) 



OUTJDELAY. 
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(OUT-RESET timeout) 47 timeout 00000001 I 

Send RESET to serial link 
Assert RESET to the serial link for the timeout period, then fetch the next instruction. 
There are no exceptions for OVT-RESET. 



RAISES 
(nothing) 

HIPPI Out Stream Operations ' OUT-RESET, 
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(OUT_PACKET timeout size) 
Send packet 



49 timeout size 



I 



64 



0 



This operation sends a packet containing a number of 256 word bursts, and one final short burst, 
if needed, out on the HIPPI interface. 

The number of eight-bit bytes to send is specified by the size field, which must be a multiple of 
eight {size is 8 » words). If LOADJMAGE is used, the number of words to fetch from memory 
must be a multiple of 16, and size is calculated by (17 * words)/2. 

The maximum packet length is 2^^ - 8 bytes. A zero length packet will cause the out stream to 
transmit an indeterminate length packet. A indeterminate length packet is completed by the LOAD 
stream issuing a LOAD-END JPACKET instruction. When the out stream receives the end packet 
indication from the load stream it empties the outbound buffer and then deasserts the packet signal. 
The instruction then completes and the next operation is fetched. Note that this handling only 
allows one indeterminate length packet to be present in the outbound buffer at a time. 

Output flow control is handled automatically by the lOP. The memory load stream produces data 
into the outbound data buffer and the output stream sends the data out as allowed by the ready 
signals from the destination. The out stream will wait indefinitely for data to appear in the 
outbound buffer. Errors in the buffer data will raise the uncorrectable^error exception. When such 
bad data is encountered, the LLRC of the burst will be corrupted to guarantee that the receiver 
discards the packet. 

During the OUT-PACKET operation, a LOAD-LINK.OUT operation will only allow any bursts 
already in the outbound buffer to be sent. If there are no more bursts present and the OUT-- 
PACKET operation has not completed, the packet will be truncated with a forced link exception. 
Note that this will cause a packet shorter than the encoded size to be transmitted. 

If the connection is lost, the operation is aborted and a connect Jost exception is raised. If the 
connection was not present before packet is asserted, then the no.connection exception will be 
raised instead of the connect Jost exception- The timeout field specifies the maximum time to wait 
for a ready pulse when a burst is ready to send, but the ready counter is zero. 

Exceptions for the OUTJPACKET opcode are timeout, uncorrectable-error, no_connection, or con- 
nect Jost. 



RAISES 

timeout, uncorrectable.error, no-connection, connect Jost 



OUT-PACKET- 
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(IN -LINK) 



CI 00000000000000 



I 



64 



0 



Link In stream 



(IN-LINK-STORE) 



CB 00000000000000 



I 



6« 



0 



Link Store stream 

The LN-LINK operation forces the HIP PI input stream to perform a link operation with the device 
driver. The link operation pair for the HIPPI input stream is located at word offset 6 and 7 of the 
program segment for lOP code. No exception is raised. 

The IN-LINK^TORE operation forces the store stream to perform a link operation. The current 
instruction of the store stream may be interrupted. If the store stream is executing a STORE.- 
DATA, STOREJMAGE, or STORE JIEPLICATE operation, it will delay the link operation until 
the in buffer is empty. The forced link bit will be set in the store stream's status word. 



HIPPI In Stream Operations • INXINK. 
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(INJLISTEN timeout) C5 timeout 00000000 I 

64 56 32 0 

Wait for connection request 

This operation waits until a connection is requested or a timeout occurs. The hardware continuously 
monitors the REQUEST signal, and registers a connection request when the signal transitions from 
deasserted to asserted. If a timeout occurs, the bad-connect exception is raised if BEQUEST is 
asserted, but has not transitioned to deasserted since the last disconnection. When BEQUEST is 
deasserted, timeout simply raises the timeout exception. 

If no exception occurs, the current value on the HIPPI input data pins is stored in the pc field 
of the link status word and the status Jink flag is set. The IN stream then waits for a new pc, 
completing a link operation. 

While an IN-LISTEN is waiting for a connection request, it will immediately abort with a forced 
link exception upon executing STORE-LINK JN in the STORE stream. 

The exceptions for the IN J-ISTEN opcode are timeout, bad.connect, or interconnectJost. 



RAISES 

timeout, bad.connect, interconnecJost, status Jink 
IN-LISTEN. 
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(INJIEJECT) C4 00000000000000 I 

Reject connection request 

This instruction asserts the connect signal, then deasserts it after eight HIPPI clock cycles. If the 
request signal is already deasserted. connect is not asserted and the connect Jost exception is raised. 

The exception to the INJIEJECT opcodes is connect Jost. 



RAISES 
connectJost 

HIPPI In Stream Operations - INJIEJECT. 
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(IN-ACCEPT swap width timeout) 6 swap 4 width timeout 00000000 I 

Accept connection request 

This instruction asserts the connect signal. K the request signal is deasserted, connect is not 
asserted and the connect Jost exception is raised. 

The width field indicates the channel width of the HIPPI in stream. When width is asserted, a 64- 
bit HIPPI channel is used. If no bursts are received within a timeout period, the timeout exception 
is raised. When width is deasserted, the order of the 32-bit words may be reversed by setting the 
swap bit. 

Input flow control is handled automatically by the lOP. A ready indication will only be signaled to 
the source when the lOP can guarantee buffer space (in the lOP or in memory) to hold the enabled 
burst. The in stream will wait indefinitely for free space in the inbound buffer. 

If a parity error or LLRC error is detected within a burst of this packet, then an error flag is 
associated with this burst, the connection is dropped (ending the packet), and the store stream will 
take an uncorrectable error exception after storing the data. 

K a burst is received with more than the maximum 256 words allowed, then an error flag is associated 
with this burst, the connection is dropped (ending the packet), and the store stream will take an 
uncorrectable error exception after storing the data. 

If a zero length packet is received, the connection is dropped and the connect Jost exception is raised. 
This instruction is only terminated via exceptions: connectJost, uncorrectable^error, timeout, or 
forceJink. In all cases, the connection is dropped when this instruction terminates. If a partial 
packet has been placed in the inbound buffer, a packet end is marked on an exception. [[Tbere is some 

race condition here that escapes xne.]] 

The exceptions to the IN-ACCEPT opcodes are connectJost, uncorrectable-error, and timeout. 



RAISES 

connectJost. uncorrcctable-error, timeout 



IN-ACCEPT- 
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(IN-LOOPBACK timeout) 



C3 timeout 00000000 



0 



I 



Enter loopback mode 

The HlPPI-input stream indicates to the HIPPI-output stream that it wishes to be in loopback 
mode. If the HIPPI-output stream acknowledges with a similar OUTJ.OOPBACK command 
before timeout occurs, loopback mode is established. When the lOP is in loopback mode, any data 
and access control state present in the outbound buffer is available for transfer into the inbound 
buffer. 

This operation is only terminated via exceptions. The connectJost exception is raised when the 
connection is terminated by the out stream executing OUT-DISCONNECT. In addition, IN-- 
LOOPBACK may be aborted with a STOREJ-INK JN operation in the store stream. 

The exceptions for the INJLOOPBACK opcode are timeout, and connectJost. 



RAISES 

timeout, connectJost 
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(INJNTERCONNECT width timeout) 67 xvidth timeout 00000000 I 

' 64 57 46 32 0 

Wait for interconnect asserted 

Wait for the interconnection lines for the specified HIPPI width to settle to an active state or for 
timeout to occur. 

The exception for the INJNTERCONNECT opcode is timeout. 



RAISES 
timeout 



INJNTERCONNECT- 



I/O Operation Descriptions 

(IN-DELAY timeout) 
W^aii 

Wait for timeout to occur, then fetch the next instruction 
There are no exceptions for the IX-DELAY opcode. 
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C7 timeout 00000000 I 

64 56 32 0 



RAISES 
(nothing) 
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Chapter 15: I/O Processor Examples 



15.1 Loading Memory 

The following code fragment may be used as a template for loading a segment of memory into the 
outbound data buffer. If an error occurs during the loading of data from the network, or the HiPPI 
out stream encounters an exception during the output command for this load, the driver will be 
notified by the link address. The notification will indicate that the pc-off set for the load stream 
is at offset 1 relative to the start of the code fragment, and the exception bit in the SSW will be 
set, along with the error bit that caused the exception. 

<Loa(LSegment> = 

(LOAD^SEGMENT dist^en memj unit limit base) 

(LOAD.DATA start.off end.off ) 

(LOAD.LINK) 



The following code fragment loads a header of upper layer protocol in unpacked format into the 
outbound buffer, and then loads the actual data in an image format into the outbound buffer. 

< LoadJlmage > = 

iSegment for ULP data 

(LOAD.SEGMENT ULPdist.en ULPmemj ULPunit ULPlimit ULPba^e) 

(LOAD.DATA ULPstart ULPend) 
'Segment for Dataset s 

(LOAD.SEGMENT dist.en memj unit limit base) 

(LOADJMAGE start.off end.off) 

(LOAD.LINK) 
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15.2 Sending Data 

The following code fragment may be used as a template for making a 64 bit connection to the 
external HiPPI domain. When the link operation occurs, the driver must look at the exception bit 
in the SSW to determine whether the operation succeeded or no external device responded to the 
connection request. 

<MQktJting>^ 

(OUT.RING 1 1000 Ifield) 
(OUT.LINK) 



If the Make Jling code fragment fails on the RING instruction then the lOP must allow for a spurious 
connect signal. The following code fragment waits for an entire connect pulse to occur or for two 
milliseconds, which ever occurs first. 

< Break JtiTig>^ 

(OUT.CANCEL 2000) 
(OUT.DISCONNECT 100) 
(OUT.LINK) 



The following code fragment may be used to transfer a packet on the HiPPI channel. The number 

of words loaded by the load stream is length. If the load stream used LOAD..IMAGE, the length must 
be multiplied by 17/16 in order to compensate for the extra state words packed into the outbound 
buffer. 

< Output J*acket> = 

(OUT.PACKET 100 length) 

(OUT_LINK) 



15.3 Receiving Data 

The following code fragment returns an I- field if an external device tries to connect with the lOP 
before a timeout occurs. The Ifield will be stored in the pc field of the status link word. 

< List€n^orJlequest> = 

(IN.INTERCONNECT 10) 
(IN.LISTEK 10000) 
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The following code fragment receives a sequence of packets from the HiPPI port. The connection 
is normally terminated by the sender, but can be broken by a STOREJLINKJN operation. 

< Receive JPackets> = 



(IN.ACCEPT 1 100) 



15.4 Storing Memory 

The following code fragment stores the upper layer protocol header in one area and the data 
information in an I/O buffer. 

< ReceiveJmage> = 
;ULP data area 

(STORE.SEGMENT ULPdist.en ULPmemJ ULPunit ULPUmit ULPbase) 
(STORE.DATA ULPstart ULPend) 
;File system 10 area 5 
(STORE.SEGMENT dist.en mem_t unit limit base) 
(STORE.IMAGE start.off end.off) 
(STORE.LINK) 



< iopexamples.asm>^ 

< Loa<LSegment> 
<LoacLImage> 
<MakeJiing> 
<BreakJLing> 
<Output.Packet> 

< Listen J'orJlequest> 

< Received acketsy 

< Receivedmage> 
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Appendix A: Operation Encoding Summary 



This chapter shows the encoding for every operation. The first column contains 64 sub columns, 
one for every bit in an instruction word, numbered from right to left. The second column is the 
assembly language prototype. Within the first column, a symbol indicates that the particular 
bit is not used. A symbol "0" or "1" indicates a literal value encoding the operation. An sdphabetic 
symbol shows where bits encoding an operand occur. If the operand name is ''xyz", then the first 
letter ''x'' is repeated to fill the object code field. A indicates an operand w^hich is not used by 
the particular operation (don't care). 



A.l M OPs 

rrrrrsssssOOOO (LOADJT r s) 

— rrrrrsssssOOOl (Iirrj.01DH r s) 

riTTTSssssOOlO — — (IHT_LOADO r s) 

rrrrrsasssOOn (IHTJLOADB r s) 

— rrrrrsssssOlOO (LOAD r s) 

rrrrrssssfiOlOl (UHSJ-OADH r s) 

rrxTrsssssOllO (UHSJ.OJLDQ r s) 

00000000000110 (HOP) 

rrrrrsssssOin (UHSJLOADB r s) 

rrrrrssaaslOOO- ' (STORE r s) 

ixxii sssaslOOl- (STOREH r s) 

rmTssssslOlO — ■ ■ (STOREQ r s) 

— mrrssssslOll— — ( STORES r s) 



A.2 MC OPs 



-mTTSftsss 1100- 
-rrrrrsssss 1 100- 



-nrrrsssss 1100- 
-rrrrxsssss 1 100- 



- iiiixs ssss 1 100- 
-rrxTTSssss 1 1 00- 



-rrrrrassss 1 100- 
-rrrrrssaasllOO- 
-rmxaaaaa 1 1 00- 
-rrrrraasss 1 1 00- 



-rrrrraasasllOl- 
-rrrrrsaaasllOl- 
-rrxTTssBas 1 10 1 - 
-rrrrrsasasllOl- 
-rrrrrassss 1 1 0 1 - 
- rrrrr s s 8 a s 1 1 0 1 - 
-rrrrrs as s s 1 1 0 1 - 
-rmTsasaallOl- 
-rmxsasssllOl- 
-rrrrrsssasl 101- 
-rxrrrssssslllO- 



-dddddddddddddddddddlO 
-aaaaaddddddddddddddl 1 
-ddddddddddddddddddlOO 
-aa&aadddddddddddddlO 1 

-aaaaaddddddddddddlOOl 
-ddddddddddddddddlOOOO 
-aaaaadddddddddddl 0001 
-ddddddddddddddddOOOOO 
-aaaaadddddddddddOOOO 1 
-dddddddddddddddddddlO 
-aaaaaddddddddddddddl 1 
idddddlOO 
ddddddlOl 
-dddddddddddddddddlOOO 
-aaaaaddddddddddddlOOl 
-ddddddddddddddddl 0000 
-aaaaaddddddddddd 1 000 1 
-ddddddddddddddddOOOOO 
-aaaaadddddddddddOOOOl 



(Ijrrj.OADBJ)ISP r a diap) 
(IHTJ.OADBJlCJ)ISP r s ac disp) 
(irrj.Dia)QJ)ISP r & dlsp) 
(IHTJ.OJLDQJLCJ)ISP r s ac disp) 
(IBTJLOADOISP r 8 disp) 
(IHT-LOADHJlCJ)ISP r s ac disp) 
(IHTJXTCHJLDD-DISP r a disp) 
(irrjrrCEJLDDJlC-DISP r a ac disp) 
(STATL10CKJ)ISP r s disp) 
(STiT£JLOCKJLCJ)ISP r a ac disp) 
(UHSJ.OJU)BJ)ISP r a diap) 
(UHSJ.OADBJlCJ)ISP r s ac disp) 
(UHS-LOiDQJ}ISP r s disp) 
(UHSJ.OAI)QJLCJDISP r a ac disp) 
(UHSJ.OADRJ)I$P r s diap) 
(UHSJ-OADHaC-DISP r s ac disp) 
(LOADJ)ISP r a disp) 
(LOADJLCJ)ISP r s ac disp) 
( REG-LOAD J)ISP r a diap) 
(REGJ.OADaCJ)ISP r a ac disp) 
(STOREB-DISP r s disp) 
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-rrrrrsssssUlO- 
-rrrrrsssssUlO- 

-nTTTSSSSSlllO- 

-rmrssssslllO- 



-rrrrrsssss 11 1 0- 



-rrrrrssssslllO- 
-mrrsassslllO- 



-rmrssssslllO 

-rrrrrsssss 11 10 

-rrrrrsssss 1110— 
-rrrrr**M • 1 1 1 1 — — 



— aaaaaddddddddddddddl 1 
— dddddddddddddddddd 100 
— aaaaadddddddddddddlO 1 
— ddddddddddddddddd 1 000 
— aaaaaddddddddddddlOO 1 
— ddddddddddddddddl 0000 
— aaaaadddddddddddlOOOl 
— ddddddddddddddddOOOOO 
—OOOOOdddddddddddOOOO 1 
— aaaaadddddddddddOOOO 1 
— xxxxxyyyyyoooooOO 1110 



(STOREBJlCJ)ISP r s ac disp) 
(STOR£QJ)ISP r s disp) 
(STDREQJIC-DISP r s ac disp) 
(STOR£HJ)ISP r s disp) 
($TOR£H ACJ)ISP r s ac disp) 
(STOREJ)ISP r s disp) 
(STOR£JlCJ)ISP r 3 ac disp) 
(STATE-STOR£J)ISP r s disp) 
(STATE^TORELERKOILDISP r s disp) 
(STATELSTOREJIC-DISP r s ac disp) 
(STREAM-CREATE-Ml r t u x y oiUex) 



-••••••••••1111 

-••••••••••1111 

— rrrrrsssss 1 1 1 1 

-rrrrrsssssllll 

-•••••sssssllll 

-•••••sssssllll 

-•••••sssssllll 

-rrrrr»»*««iiii 

-rrrrr»»»»«llli 



dddddddddddddddd 10010 

aaaaadddddddddddl 00 1 1 

••••••••••00010000010 

••««ft«»««,000 1 1000010 

••••••••••00 1 000000 1 0 



-rrrrr*«^**llii 

-rrrrrsssssllll 

-rrrrrsssssllll 

-rrrrrsssss 111 l 

-rrrrrsssssllll 



-rrrrrsssss 111 i — 

-rrrrrsssss in i— — 

-rrrrrsssssllll 

-rrrrrsssssllll 



— rrrrrsssss 1111— 
— rrrrrsssss 111 i— 
— rrrrrsssss 1111— 



-rrrrrsssss 1111 

-rrrrrsssss 1111 — 

-rrrrrsssssllll 



rrrrrsssssllll 

rrrrrsssss 111 1 

— rrrrrsssss 111 1 

—rrrrrsssssllll 

—rrrrrsssss 1111 

— rrriTsssss 111 i 

rrrrrsssssllll 

—rrrrrsssss 111 i 

rrrrrsssss 1111 

— rrrrrsssss 111 1 

— rrrrrsssss ill l 

—rrrrrsssss ill l 

— rrrrrsssss ill l — 

—rrrrrsssssllll 

— rrrrrsssssllll 

— rrrrrsssssllll 

— rrrrrsssssllll 

— rrrrrsssssllll 

rrrrrsssssllll 

— rrrrrsssssllll 

— rrrrrsssss nil 

rrrrrsssssllll 

— -rrrrr ssss s 1 1 1 1 « 



— xxxxx^sssssssddOO 1100 
—••••••••••OOoooOOOlOO 

— ••••••••••OloooOOOlOO 

—••••••»••• 10000000 100 

— •••••yyyyyOOOOO 1 1 1000 

—•••••yyyyy 0000 1111000 

— •••••yyyyyOOOlOlllOOO 
— •••••yyyyyO lOOOl 1 1000 
— •••••yyyyyO 1001 1 1 lOOO 
— •••••yyyyy 0 10 1011 1000 

—•••*»yyyyy 0 1 1 00 1 1 1 000 
—•••••yyyyyOll 10111000 

""•••••yyyyy i oooo i i i ooo 

--•••••yyyyylOOOllllOOO 

— •••••yyyyy i 00 i o i i i ooo 

— •••••yyyyy 1 00 1 1 1 1 1000 

— ••••.yyyyylOlOOlllOOO 

— •••••yyyyy 101 101 1 1000 
—•••••yyyyy 1 1000 1 1 lOOO 
— •••••yyyyyllOOllllOOO 
— •••••yyyyy 110 10 11 1000 
— •••••yyyyy 1 1 lOQll 1000 
—•••••yyyyy 1 11 10 1 1 1000 

— Ua*^yyyyy00000 1 1 1 00 1 
— aaaaayyyyy 0000 1 1 1 100 1 

ayyryyo 1000 111001 

ayyyyyOlOO 11 11001 
— aaaaayyyyyO 10 10 1 1 1 00 1 
— aaaaavyyyyOl 1001 11001 
— aaaaayyyyyO 1 1 1 0 1 1 100 1 
— aaaaayyyyy 100001 11001 
— aaaaayyyyy lOOOllllOOl 
— aaaaayyyyy 1 00 1 0 1 1 1 00 1 
—aaaaayyyyy 100 1 1 1 1 100 1 
— aaaaayyyyy 10 1 00 1 1 100 1 
—aaaaayyyyy 10 1 10 1 1 1 00 1 
— aaaaayyyyy 1 l 000 1 1 100 1 
— OOOOOyyyyy 1 1 00 1 1 1 1 00 1 
—aaaaayyyyy 11001 1 1 100 1 
—aaaaayyyyy llOlOlllOOl 



•OOOOOOOlOlO (STREAMJQUIT) 
•00001001010 (STREllUQUIT-PRESERVE) 

(IHTJIEKJLDD-DISP r s disp) 
(IKTJIDUIDDJIC-DISP r s ac disp) 
(DATJLMAP-FLUSH s) 
(DATAJUP-FLUSHJUnr s) 
(DATA-STATE JIESTORE s) 
(STREAM.CATCH r t x delay str) 
(DATAJ}PA-SAV£ r opno) 
(OATAJIPD^AVI r opno) 
( DATA-OP JIEDO r s) 
(STATELLOADJNDEX r s y) 
(REGJSTORE-IHDEX r s y) 
(STATEJSCRUBUHDEX r s y) 
(LOADJCNDEX r s y) 
(REGa.OADJHDEX r s y) 
(UHS_LOADHJroEX r s y) 
(UNS-LOADQ-IHDEX r s y) 
(UHSJ.OADB.IHDEX r s y) 
(IHTJFETCHaDDJFDEX r s y) 
(STATE-LOCKJNDEX r s y) 
(IirrXOADHaHDEX r s y) 
(IHTJEMJIDDJNDEX r s y) 
(IHTJ.OADQ-IHDEX r s y) 
(IHT-LOADB^VDEX r s y) 
(STOREaiTOEX r s y) 
(STATE-STORE-IHDEX r s y) 
(STOREHaHDEX r s y) 
(STOREQJHDEI r s y) 
(STOREBJHDEX r s y) 
(PROBE-IHDEI r s lev access y) 
(REG^TOREJlC-IHfDEX r s ac y) 
(LOADJLC-IHDa r s ac y) 
(REGJ.0AOJLCUHDEI r s ac y) 
(UNSJ.0AOUJLCUHDEX r s ac y) 
(UNS_L0ADQJICJ[1IDEX r s ac y) 
(UKS-LOADBJlC-iroEX r s ac y) 
(IHT-FETCH-ADDJICJHDEX r s ac y) 
(STATE-LOCKJlC-INDa r s ac y) 
(IKTaOADHJlCUlIDEX r s ac y) 
(IKTJ!EMJLDDJlCJirDEX r s ac y) 
(IHTJ.OADQJlCaHDEX r s ac y) 
(INT-LOADBJICUHDEX r s ac y) 
(STOREJICUNDEX r s ac y) 
(STATEJSTORE-ERROfLiroEX r s y) 
(STATE-STOR£JlC_IHDEX r s ac y) 
(STOREH-ACaHDEX r s ac y) 
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— rriTrsssssllll aaaaayyyyy 1110011 1001 (STOREQJIC JHDEX r s ac y) 

rrrrrssssslin aaaaayyyyy U 1101 11001 (STOREB -AC .INDEX r s ac y) 

— rrrrrssssslin ddddddddddddddddOlOOO ( STATE-SCRUB J)ISP r s disp) 

— rrrrrssssslin dddddddddddddddd 10000 (STATEaOADJ)ISP r s disp) 

-—rrrrrssssslin lla*«dddddddddddl0001 (PROBE_DISP r s lev access disp) 

rrrrrssssslin ddddddddddddddddOOOOO ( REG -S TORE J)ISP r s disp) 

— rrrrrssssslin aaaaadddddddddddOOOOl (REGJSTOREJICJJISP r s ac disp) 



A.3 A OPs 



•OOOOl* 



•000000- 



-•••••00100*»*««00000000000 

-•••••00100»**»*0000l0O0000 

-tttttOOl 10»»*»»dd000000000 

-tttttOlOOO»*««»00000000000 

-ttttt01000"*»«»0000 1000000 

-ttttt0l000«»"»«000 10000000 

-ttttt0l000»*«»«00011000000 

-ttttt0l001wwv***«*000000 

-tttttOlOlOnamaBamnccOOOOOO 

-tttttOlOllmmaamfflBmccOOOOOO 



-tttttOllttttttbbbbbbOOOOOO- 
-ttttt 11000vwTV¥¥wvwOOOOOO- 

-tttttllOlOvTVVVW¥¥¥000000- 



ttttt lnOOvvvvwww¥000000— 



-ttttt 11 1 lO wvvvwvwww OOOOOO 

-tttttOlOlOmmBmaaBaccOOOOOl 

-tttttOlOllmmBBOBmBCcOOOOOl 

-tttttllOlOvTWwwBwwOOOOOl— — 

-tttttvvTVVVVVVvvvvvvOOOOlO 

-000000000000000000000000 10 

-tttttimuuuTvvwiiiccOOOlOO™ 
-tttttuuuuuvTvwiiiccOOOlOl— ~- 



(BREAi) 
(STREAKJ3UIT) 
(STREAMJOUIT-PRESERVE) 
(STREAM.CATCH r t x delay str) 
(D0MAIODEHTIFIEJL3AVE t) 
(STREAKaDENTIFIEILSAVE t) 
(STREAILCUR_SAVE t) 
(STREAMJIES-SAVE t) 
(REGJIOVE t v) 
(LOGICAL-DKE t mask cn) 
(LOGICALJLLLOHE t aasJc cn) 
(BITJIASK t top bot) 
(FLOAT-SCALE t v w) 
(IRT-SHIFTJIICHT t v y) 
(IRTJIECIP.ERROR t v v) 
(FLOATJIECIP-ERROR t v ») 
(L0GICALJ3irE.TEST t mask cn) 
(LOGICALJILLOVE-TEST t wk cn) 
(IHT^HirrjlIGHT.TEST t v w) 
(IHTOim t value) 
(NOP) 

(SELECT-IHT t u v intselect cn) 
(SELECTJNT.TEST t u v intselect 



cn) 



-tttttUUUUUYWVTf f f ccOOOl 10- 



( SELECT J'LOAT t u v float select cn) 



-tttttuauuunnnnrf f f ccOOOl 11- 



(SELECT^LOATJIEST t u v lloatselect 



-tttttuuuuuOOssssssssOOlOOO- 
-tttttuuuuuO 1 ssssssssOO 1 000- 
-tttttizuuuulOOOssssssOOlOOO- 
-tt tttuuuuul 1 0000000000 1 000- 
-tttttuuuuul 100000001001000- 
-tttttuuuuull 000000 1000 1000- 
-tttttuuuuul 1 000000 11001 000- 
- tt tt tuuuuu 1100000100001000- 
-tttttuuuuul 10000010100 1000- 
-tttttumiuul 1 00000 1 1000 1 000- 
-tttttuuuuullOOOOOl 11001000- 
-tttttuuuuullOOOOlOOOOOlOOO- 



-tttttuuuuu 11 0000 100 1001 000- 
-tttttuuuuul 100001010001000- 
-tttttuuuuullOOOOlOl^ 001000- 
-tttttuuuuul 100001 1 0000 1000- 
-tttttuuuuul 1 0000 1 1 0 1 00 1 000- 
-tttttuuuuul 1 0000 1110001 000- 



(STREAK-RESERVE t u st) 
(STREAKJIESERVE-UPTO t u 
(SEIFTJXFT-Ilffl t u sh) 
(FLOATJEAR t u) 
(FLDAT.CHOP t u) 
(FLOATJ-LOOR t u) 
(FLOAT-CEIL t u) 
(IHT-HEAR t u) 
(IBT-CHOP t u) 
(IHT-FLOOR t u) 
(IHT.CEIL t u) 
(UHSJfEAR t u) 

(trjs.cHOP t u) 

(UHS-FLOOR t u) 
(UHS^CHL t u) 
(FLOATJUJUHD t u) 
(IHTJlOUro t u) 
(UBSJIOUED t u) 



St) 
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tttttuuuuul 1000 11 100001000 

tttttuuuuul 1000 1 1 1 1000 1000 

tttttuuuuul 10001 1 1 11001000 

tttttuuuuul 10 lOaaaaaOO 1000 

tttttuuuuul 101 100000001000 

ttttt*««*» 1101 1 10000001000 

-ttttt««»«110l 110001001000 

-ttttt***** 1101110010001000 

-ttttt*«»»« 1101 110011001000 

-ttttt»"»»110lllOUe001000 

=ttttt»— 11011 1 1000001000 

=ttttt»»»«»1101111001001000 

-ttttt 1 101 11 1100001000 

=tttttuuuuu00ss33ssss001001 — — 

-tttttuuuuuOlssssssssOOlOOl 

tttttuuuuulOOOssssssOOlOOl— ~ 

-tttttuuuuul 100000100001001 

^tttttuuuuul 100000101001001 

-tttttuuuuul 100000 110001001 

=tttt tuuuuul 100000 1 1 100 100 1 

= 1 1 1 ttuuuuu 1 1 0000 1 00000 1 00 1 

^tttttuuuuul lOOOOlOOlOOlOOl 

=tttttuuuuull00001010001001 

-tttttuuuuullOOOOlOllOOXOOl 

-tttttuuuuullOOOOl 101001001 

tttttuuuuul 100001 110001001 

=«**00uuuuu««*»*00000001010 

•••00uuuuu»»«*»00010001010 

3««oOuuuuu«««*«00011001010— 
•••00uuuuu***««001000010l0— 

•••00uuuuu«*»*«0010l001010 

••*00uuuuu****»00 1 1 10010 10 

•••00uuuuu»»***0l001001010 

•••00uuuuu*««»0l01000l010 

^-•00xmuuu»»««»0ll00001010 

••«00*«««««»tt««01 101001010 

•••OOuuuuu***** 10000001010 — — 

ttt01uuuuu»»«*»«»*«*001010 

ttt lOoooooooooooooooOOlOlO'— — 

ttt 1 luuuuuOOOOOOOOOOOO 1010 

tttttuuuuuooooooooooOOlOll-— - 



■ ( FLOAT JIEAL t u) 
" (FLOAtaNT t u) 

- (FLOAT-UNS t u) 

■ (PTJLSETJIC t u ac) 

- (BITJIAT.TRANSPOSE t u) 

■ (COUNT-ISSUES t) 

• (COUNT JIEMREFS t) 
' (COUNTJSTREAMS t) 

(COUNT-CONCURRENCY t) 

(COUNT-EVEirrS t ec) 

(CGUirr-PHAHTOKS t) 

(COUNT-READY t) 

( COUNT-SELECT J5AVE t) 

(STREAMJIESERVE-TEST t u at) 

(STREAHJIESERVE.UPTO-TEST t u st) 

(SHin-LEFTjm-TEST t u ah) 

(INT-NEAR-TEST t u) 

(IHT.CHOP.TEST t u) 

(IHT-FL0OR.TEST t u) 

(IHT-CEIL-TEST t u) 

(UHS-NEAR.TEST t u) 

(UHS-CHOP-TEST t u) 

(UHS^LOOR-TEST t u) 

(UHS-CEIL-TEST t u) 

(IHT-ROUHD-TEST t u) 

(UHSJIOUND-TEST t u) 

(PROGRAM-STATE-RESTORE u) 

(PROGRAMJUP-FLUSH u) 

(PROGRANJlAP-FLUSHJUrr u) 

(PROGRAK-CACHE-FLUSH u) 

(PROGRAM-CACHE-PLUSH-Ll u) 

(PROGRAJl-CACHE-FLUSH-ANY u) 

(EXCEPTIQNJIESTORE u) 

(SSVJIESTORE u) 

(DOMAIN-LEAVE u) 

(DOMAiNjarrER) 

(COUNT-SELECT-RESTORE u) 
(TARGET-RESTORE tn u) 
(TARGETJ)ISP tn ollset) 
(TARGET-INDEX tn u) 
(STREAM-CREATE-IMM r t u x y offset) 



tttttuuuuuvwwOOOOOOOllOO- 
ttt ttuuuuurTvwOOOO 100 1 1 00- 
t ttttuuuuu wwvOOO 1 000 1 1 00- 
tttttuuuuuvvwvOOOllOOllOO- 
tttttuuuuuvwwOOlOOOOllOO- 
ttt t tuuuuuvYvwOO 1 0 1 00 1 1 00- 
tttttuuuuuwwvOO 1100011 00- 



tttttuuuuuvwwOOl 1 1001 100 

tttttuuuuuwwvO 100000 1100 

tttttuuuuuwvwOlOOlOOllOO 

tttttuuuuuwvTvOlOlOOOllOO 

tttttuuuuuvwwOlOllOOllOO 

ttt ttuuuuuOOOOOO 1111001 100 

tttttuuuuuvww 10000001 100 

tttttuuuuuvwwlOOOlOOllOO 

tttttuuuuuvww 10 10000 1 100— 
1 1 1 1 tuuuuuwvw lOlOlOOll 00-—— 



(BITJIHP t u v) 
(BIT-AHD t u ▼) 

(BIT-XOR t u v) 
(BITJDR t u v) 
(BITJIOR t u v) 
(BITJCNOR t u v) 
(BIT-NAHD t u v) 
(BIT-IMP t u v) 
(BITJ]DD-NIMP t u v) 
(BIT-ODDJOfD t u V) 
(BIT-DDDJCOR t u v) 
(BITJJDDJDR t u v) 
(BIT.TALLY t u) 
(BITJUT-DR t u v) 
(BIT-MATJCGR t u v) 
(BITJ>ACK t u v) 
(BIT.UNPACK.1 t u v) 
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tttttuuuuuvvvvvlO 110001 100 

-t ttttuuuuuvvvvv 101110011 00 — 

tttttuuuuuvvvvvOOOOOOO 1101 

— 1 ttttuuuuuvvvvvOOOO 1 00 11 0 1 

tttttuuuuuvvvvvOOO 1000 1101 

tttttuuuuuvvvvvOOO 11001101 

ttt ttuuuuuvvwvOO 1 0000 1101 

ttt ttuuuuuwvwOO 1 0 1 00 1 1 0 1 — — 

tttttuuuuuwvwOOllOOOUOl 

tttttuuuuuwvwOOl 1100 1101 

——— —ttt ttuuuuuvwwO 100000 1 101— — — — 

— — —ttt ttuuuuuvwwO lOOlOOllOi — ~ 

1 tt 1 1 uuuuu wvwO 1010001101 

ttt t tuuuuu w vwO lOllOOllOl 



1 tt t tuuuuuOOOOOO 1111001101 

™™— — ttt ttuuuuuvvvw 1 000000 111 0— — 

-—— 1 ttt t uuuuu wwv 10001001110 

— ttt ttuuuuuvvwv 1001000111 0 

1 ttt tuuuuu WW V 10011001110 

tttttuuuuuwvw 10 10000 1110 



ttt ttuuuuuvvvw lOlOlOOlllO- 

ttttt**»»»wvwl 100100 1110- 

— tttttuuuuuwvwl 1 1000011 10- 

tttttuuuuuwvw 11101001110- 

tttttuuuuuwvwl 1 1 1000 1 1 10- 



-tttttuuuuuwvw 1111100111 0 

-tttttuuuuuwvwlOOOlOOllll 

-ttt ttuuuuuvvvw 100 1 000 1111 

-tttttuuuuuwvwlOOl 1001 111 

-tttttnuuuuvwwlOlOOOOl 1 11 



— tttttnuuuuvwwlOlOlOOllll- 
— ttt ttuuuuu vvvw 111 0000 1111- 
— tttttuuuuuwvwl 1101001111- 
— tttttuuuuuwvwl 1 1 1000 1111- 
— ttt ttuuuuuvvvw 11111001111- 
— ttt ttuuuuuvvvw w w w w O 101 00- 
— tttttuuuuuwvwvwwO 10101- 

— tttttUUUUU WVVVWWWtfW O 10110- 

— tttttuuuuuwwwwwOl 1000- 
— 1 1 1 1 1 uuuuuvwww vwO 11001- 
— tttttuuuuuwwvwwvO 1 10 10- 
— tttttUUUUU W V WVWIIWWW O 11011- 
011100- 
011101- 
011110- 
011111- 




-tttttuimun^ 

-tttttUUttUU^ 
-tttttUUUUU' 
-tttttUUUUU' 

-tttttuuttuuOvwwwwl 00000- 



•tttttuuuuul^ 



nooooo- 



-tttttuuuuuO vvvvv WW 1 0000 1 — 
- 1 1 1 1 1 uuuuu 1 vwwww 1 0 0 0 0 1 — 
-tttttnuuuuwwvwwv¥lC0110— 

-tttttUttUUU WVVVWWWWW 100111 — 
-ttt t tuuuuu VVWVWW¥ 101 000— 
-tttttuuuuuv wvvwuwww 1 0 1 00 1 — 
-ttt t tuuuuuwwwww 1 0 1 0 1 0— 
-tttttuxiuuu vvvvvwyywu 1 0 1 0 1 1 — 
-tt t ttUUUUU WVWW W¥ 1 0 1 1 0 0— 
-ttt ttuuuuuv w wwywww lO 1101— 



(BIT.UNPACK-2 t u v) 

(BIT.UNPACK.3 t u v) 

(BIT JIIKP .TEST t u v) 

(BIT-AND.TEST t u v) 

(BIT-XDR.TEST t u v) 

(BIT-DR-TEST t u v) 

— (BITJIOR.TEST t u v) 

— (BITJCKOR.TEST t u v) 
(BITJiAND.TEST t u v) 

— (BIT-IMP.TEST t u v) 

— (BITjDDDJilMP-TEST t u v) 

— (BITJ3DDJIND.TEST t u v) 

— (BIT-DDD-XOR.TEST t u v) 

— (BITJ)DD-OR-T£ST t u v) 

— (BIT.TALLY.TEST t u) 

— (FLOAT-ADD t u v) 

— (FLOAT^UB t u v) 

— (FLOAT-MIN t u v) 
~ (FLOATJIU t u v) 
~ (FLOAT-MMIH t u v) 

— (FLOATJMAX t u v) 

— (REAL-FLOAT t v) 

— (IHT-ADD t u v) 
~ (IHT-SUB t u v) 
~ (IHTJIIN t u v) 

— (IHTJIAI t u v) 

~ (FLOAT-CMP.TEST t u v) 

— ( FLOAT JIIH.TEST t u v) 
~ (FL0ATJ1AX.TEST t u v) 

— (FLOATJMH-TEST t u v) 

— ( FLOAT JMAJUTEST t u v) 

— (IHT-ADD.TEST t u v) 

— (IHTJSUB.TEST t u v) 

— (IKT-MIH.TEST t u v) 

— (irrJlAI.TEST t u v) 

— (SHIFT-PAIR-LEFT t u v v) 

— (SHIFT-PAIRJ£FT.TEST t u v w) 

— (SHIFT-PAIRJIIGBT t u v ¥) 

— (SHIFT-PAIILRIGHT-TEST t u v w) 

— (IHTJ)IV.CHOP t u V w) 

— (IHTJ)IV-CHOP.TEST t u v v) 

— (IHT-DIV-FLOOR t u v ») 

— (IHTJIVJ'LOOILTEST t u v ») 

— (UHSJ)IV t u V f) 

— (UHS-DIV-TEST t u v w) 

— (FLOAT-DIV t u v w) 

— (FLOAT-SORT t u v w) 

— (IHTJIDD-Iim t u value) 

— (IKT-SUB«IMM t u value) 

— (IHTJLDD-IMM-TEST t u value) 

— (IHT-SITB-IMLIEST t u value) 

— (BIT-MERGE t u v w) 

— (BIT-KERGE-TEST t u v w) 

— (IHTJa)D-KUL t u v b) 

— (IKTJLDDJIUUTEST t u v w) 

— CIHT_SUBJ!UL t u v f) 

— (IKT-SUBJIUL-TEST t u v w) 

— (UHSJIDD-MUL-UPPER t u v f) 

— (UNS-ADDjnnjJPPER-TEST t u v w) 
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-tttttuuuuuvwwvwirovlOlllO (IBT^UB-MULJIEV t u v w) 



tttttuuuuuwvwv¥wl0111i (IHTJSUBJWLJIEV.TEST t u v w) 

tttttuuuuuvwwpwwilOOOO (FLOATJlDDJfUL t u v y) 

-Cttttuuuuuvwwirewwwl 10010 (FLOAT-SUB-MUL t u v v) 

tttttuuuuTivwwvff^rvl 10100 ( FLOAT JflJULOVER t u v w) 

ttttttmuinirrnnnnniOTBllOllO (FLOAT-SUB-KULJIEV t u v 

tttttuuuuu Yww tf www w w l 11000 (FLOATJTER t u v ») 

tttttuuuuuvuvvvywuwwlllOlO (FL0ATJ)IVJLPPROX t u v w) 

tttttuuuuuwwwwnrlUOU (FLOAT-SQRTJLPPROX-TEST t u v 

tttttuuuuuTTwwvOTvllllOO (FLOAT-DIVJMOR t u v ») 

ttttttnmttuwywyinrTPUnoi (FLOAT^QRT-IRRfllLTEST t u t ») 

tttttuuuuuvwvTrOTwy»iiiiii (FLOATJlSqRT^OR.TEST t u v 



A.4 C OPs 



.-•••••yyyyyOOOOlOOOOOO ( FLOAT JIPPROXJIESTORE y) 
-xxxxxyyyyyOOO 10000000 (REGJIOVE x y) 
-xxxxxtttttOOOllOOOOOO (TRAP^AVE x tr) 
-xxxxxyyyyyOO 100000000 (BIT-LEFT-OHES x y) 
-xxxxxyyyyyOOlOlOOOOOO (BIT-IXFT-2ER0S x y) 
-xxxxxyyyyyOO 110000000 (BITJUGHT-DJTES x y) 
-xxxxxyyyyyOO 111000000 (BITJUGHT-ZEROS x y> 
-xxxxxyyyyy 01000000000 (IHT-RECIPJIPPROX x y) 
-xxxxxyyyyyOlOOlOOOOOO (IHT-RSQRTJIPPROX x y) 
-xxxxxyyyyyO 10 10000000 (STREAILCOUUT-IHSTJIESTORE x y) 
-xxxxxyyyyyOlO 11000000 (IBT-LOGB x y) 
-xxxxxyyyyyO 1100000000 (FLOATJIECIPJIPPROX x y) 
-xxxsxyyyyyOl 101000000 ( FLOAT JISQRTJIPPROX x y) 



rxxxxyyyyyOl 110000000 (IHTJIECIP^HIFT x y) 

xxxxxyyyyyOl 111000000 (UHSJIECIP-SHIFT x y) 

xxxxx«**« 11000000000 (STREAJl.COUHT«IHST x) 

000000000011001000000 (HOP) 



-xxxxxyyyyy 11001000000 (CLOCK x y) 



-xxxxx*»»»«l 1100000000 (EXCEPTIOH-SAVE x) 
-xxxxx*»««« 11101000000 (RESULTCODE-SAVE x) 
-xxxxx»»*«*lllllOOOOOO (STREAKXOOKAHEAD-SAVE x) 



-xxxxxyyyyyOO 100000001 (BITJLEFT-OHES-TEST x y) 



-xxxxxyyyyyOOlOlOOOOOl (BIT-LEFT-ZEROS-TEST x y) 

-xxxxxyyyyyOOl 10000001 (BIT-RIGHTjOHES-TEST x y) 

-xxxxxyyyyyOO 11 1000001 (BITJIIGHT-2ER0S-TEST x y) 

-xxxxxyyyyyOlOllOOOOOl (IHT-LOGB-TEST x y) 

-xxxxxyyyyyOl 100000001 ( FLOAT JIECIPJIPPROX-TEST x y) 

-xxxxxyyyyyOllOlOOOOOl (FLOAT-RSQRTJLPPROX-TEST x y) 



xxxxxyyyyyO 11 10000001 (IRTJIEC IP .SHIFT-TEST x y) 

xxxxxyyyyyOl 11 1000001 (UHS-REC IP-SHIFT-TEST x y) 

xxxxxyyyyyzzzzzOOOOlO (ROTATE-RIGHT x y z) 

xxxxxyyyyyzzzzzOOOOll (ROTATEJUGHT-TEST x y z) 

xxxxxyyyyyvwTvOOOlOC ( IHT JIDD_IMM x y value) 



xxxxxyyyyywvwOOOlOl (IHT-ADD-MLTEST x y value) 

xxxxxyyyyyrnr/wOOOllO (IHT-SUB-IMM x y value) 

^— xxxxxyyyyywvwOOOlii (Iirr^UBJ[Ml.TEST x y value) 

xxxxxyyyyyzxzxzOOllOO (FLOATJLDD x y z) 



— xxxxxyyyyy zzzzzOO 11 10 (FLOAT-SUB x y z) 

xxxxxyyyyyzzzzzOOllll ( FLOAT.CMP.TEST x y z) 

— xxxxxyyyyyzzzzzO 10000 (BITJJIMP x y z) 

— xxxxxyyyyyzzzzzOlOOOl (BITJIMP-TEST x y z) 
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xxxjcxyyyyyzzzzzOlOOlO (BIT-AND x y 2) 

: xxxxxyyyyyzzzzzOlOOll (BITJ^ND.TEST x y 2) 

xxxxxyyyyyzzzzzOlOlOO (BI7_X0R x y 2) 

xxxxxyyyyyzzzzzOlOlOl (BITJtOR-TEST x y z) 

xxxxxyyyyyzzzzzOlOUO (BITJDR x y 2) 

xxxxxyyyyyzzzzzOlOUl (BITJ3R-TEST x y 2) 

xxxxxyyyyyzzzzzOllOOO (SHirT-LEFT x y 2) 

xxxxxyTyyyzzzzzOllOOl (SHIFTJ-ErT-TEST x y z) 

xxxxxyyyyy 222220 11 010 (ROTATEJXFT x y 2) 

xxxxxyyyyyzzzzzO 11011 (ROTATEOZFT.TEST x y 2) 

xxxxxyyyyyzzzzzO 11100 (UNS-SHIFTJIIGHT x y z) 

xxxxxyyyyyzzzzzO 11 101 (UKS«SHIFTJIIGHT-TIST x y z) 

xxxxxyyyyyzzzzzDllllO (IHT-SHIFTJIIGHT x y z) 

xxxxxyyyyyzzzzzOlllll (IHT-SHIFTJIIGHT-TEST x y z) 

xxxxxyyyyyaaaaalOOOOO (PTR-SETJIC x y ac) 

xxxxxyyyyyzzzzzlOOOOl (UHS-ADD-CARRY.TEST x y z) 

oooooyyyyytttttlOOOlO (TRAP JIESTORE tr y) 

xxxxxyyyyyzzzzzlOOOll (UKS-SUB_CARRY.TEST x y z) 

xxxxxyyyyy222Z2l00100 (IHTJLDD x y 2) 

xxxxxyyyyy zzzzz 100 101 ( IHT JIDD^TEST x y 2) 

xxxxxyyyyyzzzzzlOOllO (IHT^UB x y z) 

xxxxxyyyyyzzzzzlOOllX (IHT^UBJTEST x y z) 

xxxxxOOOooooooOnOooo (SSHJ)ISP x offset) 

aaaaamaaccoooolllOooo (SKIP mask cn offset) 

xxxxxOOOoollllOUOttt (TARGETJSAVE x tn) 

DnaaBBmBCClllllllOttt (JUMP Bask cn tn) 

oooooOOlllOOOOOllOOOO (LEVEL£HTER lev) 

oooooOOllinilOnOttt (LEVELJITH lev tn) 

anaaBBaBccooooOlllooo (SKIPJ)rTEB aask cn offset) 

aBoaHBaaccoooollllooo (SKIPJSELDOH Bask cn offset) 

■■ ■ — aoaBaaasccllllOlllttt (JUHP-OFTEH aask cn tn) 

aoBBaBBaccllllllllttt (JUHPJSELDOH aaak cn tn) 



A.5 MAC CPs 

OOOOOOOOOOOOOOlOOOOcccccccccccccccccccccccccccccccccccccccccc (DEBUG cookie) 

— rmrOOOOOOOOOlll 11000000000000000000000000000000000000000000 (DEBUG JIEG r) 

— ooooooooooi Iiiooooia00100«»oooo00000000000ooooaooooo00000001010 (STREAHJQUIT) 

— ooooooooooiiiioooooOOlOOoooooOOOOlOOOOOOooooooooooOOOOlOOlOlO (STREAKjQUIT-P RESERVE) 

nTrroopQoiiiitttttOOllOoooooddOOOOOOOOOxxxxxosssssssddOOllOO (STREAfLCATCH r t x delay str) 

— rrrrroooooiiiitttttuuumiooooooooooOOlOllxxxxxyyyyyoooooOOlllO (STREAILCREATE-IMH r t n x y offset) 



A.6 I OPs 

OOOOOOOOOdaaOOOOuuuuuuuuOlllllllllllllllOOOOObbbbbbbbbbbbbbbbbbb ( IHST^EGMENT dist-en a-type unit liait 
base) 

0001000000000000000000000000000000000000000000000000000000000000 (LOAD-LIKK) 

OOOlOOOlOdaaOOOOuuuuuuuuOllllllUlllllllOOOOObbbbbbbbbbbbbbbbbbb (LOAD -SEGMENT distjen a-type unit liait 
base) 

0001001000000000000000000000000000000000000000000000000000000000 (LOAD-ERRJOFFSET) 

OOOIOOIIOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOCOOOOOOOOOOOOOOOOOOOO (LOADJ'LUSH) 

0001010000000000000000000000000000000000000000000000000000000000 (LOAD-LINK-DUT) 

0001010100000000000000000000000000000000000000000000000000000000 (LOAD-EKD-PACKET) 
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OOlOsssssssssssssssssssssssssssaeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee (LOADJ)ATA start.offset «n<Lolfset) 
OOUsssssssssssssssssssssssssssseeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee (LOADJMAGE start j)fiset encLoffset) 

0100000100000000000000000000000000000000000000000000000000000000 (oirTJ.i!nc) 
OlOOOlOOttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (OUTJ)ISCORHECT tineout) 
OlOOOlOlttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (OUT.CANCEL tiaeout) 
OlOOOlllttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (OUTJ)£UY tiaeout) 
OlOOOlUttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOl (OUTJIESET tiaeout) 

0100100000000000000000000000000000000000000000000000000000000000 (OUTJ.m_LOAD) 
OlOOlOOlttttttttttttttttttttttttssssaassassssssssssasassssssssss (OUTJ»ACKET tiaeout size) 

0100101000000000000000000000000100000000000000000000000000000001 (OirrXOOPMODE) 
OlOOlOlOttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (Otrr_LDOPBACK tiaeout) 
OlOOlllrttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (OUTJHTERCOHHECT width tiaeout) 
OlOsOOmtttttttttttttttttttttttllllllllllllllliiinilllllllllll (OUTJIIHG swap width tiaeout Ifield) 

1000000000000000000000000000000000000000000000000000000000000000 (STOR£LLIHX) 

lOOOOOOlOdaaOOOOuuuuuuuuOUlllllllinillOOOOObbbbbbbbbbbbbbbbbbb (STORE-SEGMEBTT diat^n m-type unit limit 
baae) 

1000001000000000000000000000000000000000000000000000000000000000 (STOIULFLUSH) 
lOOOOOaiOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (STORLXRRjDFFSET) 
1000010000000000000000000000000000000000000000000000000000000000 (STORE-LIHKaH) 
lOOOOlOlOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (STORE_EHDJ>ACKET) 
lOOOOllOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (STORELEHDJSEGMEBT) 

lOOlsssssasssssssssssasssssssssseeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee (STOREJIEPLICATE startjoffset endjoflsct) 

lOlOsssasssssssssssssssssssssssseeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee (STOR£J)ATA start-otlaet endjoflaet) 

lOHsssssasssssssssssssssssssasseeeeeeeeeeeeeeeeeeeeeeeeeeeeeeee (STOREOHAGE start jolfaet end^ffset) 

1100000100000000000000000000000000000000000000000000000000000000 (IHJ.IHK) 
llOOOOllttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (IH-LOOPBACK tiaeout) 
IIOOOIOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (IHJIEJECT) 
llOOOlOlttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (IHaiSTEN tiaeout) 
nOOOlllttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (INJ)EUY tiaeout) 
1100101100000000000000000000000000000000000000000000000000000000 (ISJ.IHKJSTORE) 
llOOlllwttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (IH-IHTERCaHHECT width tiaeout) 
llOslOOwttttttttttttttttttttttttOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO (IHJICCEPT swap width tiaeout) 
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Appendix B: Processor State 



The foUowing table describes aU of the state information maintained bv a processor. The rows 
describe what state information is maintained and whether user, supervisor, and IPL privilege 
can directly read (abbreviated '^r") or v.rne (abbreviated '^w"), that state. The asterisk ("*") 
indicates that the state cannot be written, but can be indirectlv modified. Kernel level has the 
same capabilities as supervisor level. An unfilled entry is the same as the one above it 



per 


LEV.USER 


LEV^UPER 


LEVJPL 


number 


bits 


what 


reference 


stream 


rw 


rw 


rw 


1 


64 


stream status word 












1 


vH 


exception register 


§9.1 










1 


64 


result code register 


§9.1 










31 


64 


ceneral Durnose reeisters 












8 


32 


target refidsters 


82 2 






rw 




1 


16 


instruction count register 






- 


rw 


1 


4 


protection domain 


ZR 2 




- 


- 


- 


1 


2 


stream level 


§8.1 


Pd 


r* 


r* 


r* 


1 


36 


instruction issue counter 












1 


56 


memory reference counter 












1 


56 


stream counter 


8in 










1 


56 


concurrency counter 


§10 










4 


64 


selectable event counters 


§10.2 






rw 


rw 


1 


64 


data state descriptor 


§6.2 










1 


64 


program state descriptor 


§7.1 










16,384 


64 


data address map entries 


§6.2 




r* 






8,192 


64 


program address map entries 


§7 






r* 


r» 


1 


7 


stream reserved, SRES£? 


§2 










1 


7 


stream current, scvkd 


§2 


proc 


rw 


rw 


rw 


384 


64 


trap registers 


§9.2 










512 


64 


data control registers 


§6.3 










512 


64 


data value registers 


§6.3 




r 


r* 


r* 


132 


32 


program address TLB entries 


§7 










1024 


64 


data address TLB entries 


§6.2 




r 


r 


rw 


256 


32 


reciprocal table 










256 


32 


reciprocal square root table 






r 


r 


r 


1 


56 


phantom counter 


§10.2 










1 


56 


ready counter 


§10.2 










1 


64 


clock 


§10 
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Appendix C: GF(2) Addressing Matrices 



C.l Scraunbling Matrices 

This is the GF(2) matrk used for address scrambling: 



1011000011000110101 

1010010001111010101 

0011001110000001010 

0000011100010110010 

0001110100001001010 

0101000110001010001 

0100100000011100010 

0100010010010001110 

0110011110110001010 

0011111110001111010 

1000110100000101101 

0110110111110111001 

0011011000011101110 

0001111010110110010 

0 0 0 0 1 0.0 0 00 0 0 1 1 1 1 1 0 1 

0000011001111100110 

0000001000000110010 

0000000100010101110 

0000000010001100001 

0000000001111010101 

0000000000110010001 

OOOOOOOOOOOllOOlOOl 

0000000000001110010 

0000000000000111001 

0000000000000010001 

0000000000000001100 

OOOOOOOOOOOOOOCOlOO 

OOOOOOOOOOOOOOOOOlO 

0000000000000000001 



This is the GF(2) inverse matrix: 
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1000111101010101011 

0111011100100001010 

0011100010110011101 

0001110011100001000 

0000100000001001011 

0000011001000000001 

0000001000000101111 

0000000100011010100 

0000000010001010010 
0000000001101101011 
0000000000111110010 
0000000000011100010 
0000000000001101111 
0000000000000111100 
0000000000000010001 
0000000000000001100 
0000000000000000100 
0000000000000000010 
0000000000000000001 

This is the data single-bit error syndrome table. If a s}Tidrome is found in this table, then the 
bit 4*Tow + col is in error. If the syndrome is zero, there is no error. Otherwise, there is 
uncorrectable error. Bits 71-64 are the ecc bits. 



0x80 0x40 0x20 0x10 
0x08 0x04 0x02 0x01 
0xc6 Oxel 0xe2 Oxdl 
0xc9 0xd2 0xe4 OxeS 
0xd4 Oxca 0xc5 OxcS 
0x86 Oxal 0xa2 0x91 
0x89 0x92 0xa4 0xa8 
0x94 0x8a 0x85 0x83 
0x8c OxbO OxaO 0x90 
0x88 0x84 0x82 0x81 
0x46 0x61 0x62 0x51 
0x49 0x52 0x64 0x68 
0x54 0x4a 0x45 0x43 
Ox4c 0x70 0x60 0x50 
0x48 0x44 0x42 0x41 
0x06 0x21 0x22 0x11 
0x09 0x12 0x24 0x28 
0x14 OxOa 0x05 0x03 



This is the access state single-bit error syndrome table. If a syndrome is found in this table, then the 
bit row is in error. If the syndrome is zero, there is no error. Otherwise, there is an uncorrectable 
error. Bits 7-4 are the ecc bits. Bits 3-0 are the access state. 
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0x8 
0x4 
0x2 
0x1 
0x7 
Oxb 
Oxd 
Oxe 
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A 

A-operation 15, 17, 52, 147 

A-unit 15, 46, 50, 137 

AccessState (struct) 23 

access control 10, 23, 24, 29. 32. 55, 114, 121, 

134, 138, 142, 154, 155, 157, 159, 160, 

161, 162, 212, 227 
access state 10, 23, 29, 33, 55, 144, 146, 154, 

155, 156, 157, 158, 204, 207, 243 
active 12. 219, 228 

address 10. 12, 14, 23, 24, 25. 26, 28, 29, 30, 
31, 33, 35, 36, 37, 38, 39, 40, 41, 44, 
45, 48, 55, 56, 80, 81, 83, 138, 139, 
140. 153, 194, 204, 207. 212, 230, 242 

address translation 23, 24, 26, 33, 200. 206 

align 26, 154, 155, 194 

allsig (field in ProgramStateDescriptor) 41 

Aop (field in Operation) 15 

AJoat_result_code (field in ResultCode) 46 

A Joat jesultjeg (field in ResultCode) 46 

B 

bad^onnect (field in lOPStatusWord) 194. 214, 
224 

base (field in DataStateDescriptor) 9, 10, 28, 
30, 39, 41, 53, 89, 119, 125, 178, 198, 
200, 206 

BIT_A.ND.TEST (operation) 58 

BIT_A.ND (operation) 58 

BITJMP.TEST (operation) 59 

BITJMP (operation) 59 

BIT_LEFT.ONES.TEST (operation) 60 

BIT_LEFT_ONES (operation) 60 

BIT_LEFT.ZEROS.TEST (operation) 60 

BIT_LEFT-ZEROS (operation) 60 

BIT_MASK (operation) 61 

BITJvIAT.OR (operation) 62 

BIT_MAT.TRANSPOSE (operation) 62 

BIT.MATJCOR (operation) 62 

BIT_MERGE-TEST (operation) 63 

BIT_MERGE (operation) 63 

BIT_\AND.TEST (operation) 64 



BITJCAND (operation) 64 
BIT_\IMP_TEST (operation) 65 
BIT_\IMP (operation) 65 
BIT_NOR.TEST (operation) 66 
BIT_NOR (operation) 66 
BIT.ODDj\.ND.TEST (operation) 67 
BIT.0DD.AKD (operation) 67 
BIT.ODDJ^'IMP.TEST (operation) 67 
BIT.ODD_NIMP (operation) 67 
BIT_ODD-OR.TEST (operation) 67 
BIT.ODD.OR (operation) 67 
BIT.ODD_XOR.TEST (operation) 67 
BIT.ODD_XOR (operation) 67 
BIT.OR-TEST (operation) 68 
BIT.OR (operation) 68 
BIT_PACK (operation) 69 
BITJaCHT-ONES.TEST (operation) 70 
BITJaCHT-ONES (operation) 70 
BITJUGHTJEROS.TEST (operation) 70 
BIT_RIGHT_ZEROS (operation) 70 
BIT.TALLY.TEST (operation) 71 
BIT-TALLY (operation) 71 
BIT.UNPACKJ (operation) 72 
BIT.UNPACK.2 (operation) 72 
BIT.UNPACK-3 (operation) 72 
BITJCNOR.TEST (operation) 73 
BIT_XNOR (operation) 73 
BIT_XOR.TEST (operation) 74 
BITJCOR (operation) 74 
branch 11, 13, 15, 16, 17, 48, 131, 152, 153, 
169 

BREAK (operation) 75 

bytejoffset (field in DataAddress) 26 

byte 9, 23, 24, 25, 26, 30, 31, 33, 62, 116, 138, 

154, 155, 159, 175, 194. 204, 207, 212, 

222 

C 

C-operation 15, 17, 52 
C-unit 15, 46, 50, 52, 137 
carry bit 17, 20 

ccJ) (field in StreamStatusWord) 12 
ccl (field in StreamStatusWord) 12 
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cC-2 (field in StreamScaiusWord) 12 
ccJ (field in StreamStaiusWord) 12 
chop 86. 109, 171. 190 
CLOCK (operation) 51. 76. 137 
CNT_A._NOP (enum in CountSource) 50, 137 
CNT.CONCURRENCY (enum in CountSource) 
51 

CNT.CREATE (enum in CountSource) 51, 166 
CNT.C_NOP (enum in CountSource) 50, 137 
CNTJFLOAT_ADD (enum in CountSource) 50, 

87, 88, 90, 100, 106, 107 
CNT_FLOAT_DrV' (enum in CountSource) 50, 

91 

CNT_FLOATJ^UL (enum in CountSource) 50, 

88, 100, 107 
CNTJ'LOAT-SQRT (enum in CountSource) 

50, 105 

CNT J"LOAT_TOTAL (enum in CountSource) 

50, 87, 88, 90, 91, 92, 93, 96, 97, 98, 
99. 100, 103, 104. 105, 106, 107 
CNTJNT-FETCH^DD (enum in CountSource) 
50, 114 

CNTJSSUES (enum in CountSource) 51 
CNTJUMP-EXPECTED (enum in CountSource)'' 
50. 131, 152 



condition code 11. 12, 17. 19, 20, 55, 90, 92, 
93. 96. 97. 98. 99. 103, 110, 111, 128, 
129, 131. 135, 136. 152 
condition mask 17. 131, 135, 136, 152 
CondMask (enum) 17. IS. 19, 131, 135, 136. 
152 

COND JJEG.C (enum in CondCode) 17 
COND_NEG_N'C (enum in CondCode) 17 
COND.OVFNAN.C (enum in CondCode) 17 
COND-OVFNAN_NC (enum in CondCode) 17 
COND_POS.C (enum in CondCode) 17 
CONDJ»OS_NC (enum in CondCode) 17 
COND_ZERO.C (enum in CondCode) 17 
CONDJERO-NC (enum in CondCode) 17 
connect Jost (field in lOPStatusWord) 194, 214, 

215, 222. 225, 226, 227 
Cop (field in Operation) 15 
counter 10, 38, 39, 40, 49, 50, 51, 77, 163, 165 
CountSource (enum) 49, 50, 51, 77 
COUNT.CONCURRENCY (operation) 49, 77 
count_disable (field in StreamStatusWord) 13, 

49 

COUNT-EVENTS (operation) 50, 77 
COUNTJSSUES (operation) 49, 77 
COUNT-MEiMREFS (operation) 49, 77 



CNTJUMP.UNEXPECTED (enum in CountSource)^^^'^'^-^^-^-'^'^^^^ (operation) 79 

^COUNTJIEADY (operation) 79 
COUNT.SELECTJIESTORE (operation) 49, 



50. 131, 152 

CNT_LEVEL (enum in CountSource) 50, 132 
CNTJ.OAD (enum in CountSource) 50, 116, 

117, 118, 133, 134. 144, 154, 175, 176, 

177 

CNT_MEMREFS (enum in CountSource) 51 
CNT-MEMJIETRY (enum in CountSource) 
50 

CNTJ^_NOP (enum in CountSource) 50, 137 
CNT.QUIT (enum in CountSource) 51, 167 
CNT.STORE (enum in CountSource) 50, 146, 

155, 157, 158, 159. 160, 161, 162 
CNTJSTREAMS (enum in CountSource) 51 
CNT.TARGET (enum in CountSource) 50. 169 
CNT.TRANSFER.TOTAL (enum in CountSource) 

50. 131, 152 
CNT.TRAP (enum in CountSource) 48, 51 
code generator 15 
complex 56 

concurrency counter 49. 77, 241 
CondCode (enum) 12, 17 



77, 78 

COUNT^ELECT^AVE (operation) 49, 78 
COUNT.STREAMS (operation) 49, 77 
create (field in ExceptionRegister) 44, 45, 83, 
166 

create exception 44, 83, 166 

cv (condition code) 12, 17, 131, 135, 136, 149, 

152, 153, 172 
C_floatjresult-code (field in ResultCode) 46 
C_float Jesuit j-eg (field in ResultCode) 46 



D 



DataAddress (struct) 10, 24, 25, 26 
DataAddrUns (a base type) 10, 24, 31 
DataControlDescriptor (struct) 31 
DataFrame (a base type) 10, 26, 39 
DataMapEntry (struct) 26 
DataResultCode (enum) 46, 47 
DataSegment (a base type) 10, 26, 39 
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DataStaieDescripior (struct) 39 
daia-aiignment (field in ExceptionRegister) 13, 

44, 45, 81, 114. 116, 117. 118. 121. 
133. 134. 144, 146. 156. 157. 158. 159. 

160. 161, 162. 175. 176. 177 
data_blocked (field in ExceptionRegister) 44, 

45, 81. 114. 116, 117. 118, 121. 133, 
134. 144. 146, 156, 157, 158, 159, 160, 

161, 162, 175, 176, 177 
dataJrame-offset (field in DataAddress) 26 
data_bw.error (field in ExceptionRegister) 44, 

45, 114, 116, 117, 118, 121, 133, 134, 
144, 146, 154, 155, 156, 157, 158, 159, 
160, 161, 162, 175, 176, 177 
DATA_MAP J"LUSH_A.NY (operation) 28. 32, 
80 

DATA_MAPJFLUSH (operation) 28, 32. 80 
DATA-OPA^AVE (operation) 31, 32. 47. 81 
DATA-OPD^AVE (operation) 31, 32, 47, 81 
DATA.OP-REDO (operation) 31, 47, 81 
data_prot (field in ExceptionRegister) 39, 44, 
45, 80, 81, 114, 116. 117, 118. 121, 
133, 134, 138, 144, 146, 154, 155, 156, 
157, 158, 159, 160, 161, 162, 175, 176. 
177 

data^egmentjiumber (field in DataAddress) 

26 

data^egmentjofFset (field in DataAddress) 26 
DATA^TATEJIESTORE (operation) 32, 39, 
80, 82 

data address unimplemented 30, 47 

data address 26, 28, 29, 35, 80, 198, 241 

data alignment 25, 26, 32, 47 

data control register 31, 32, 241 

data map limit 25, 26, 32, 39, 47, 80 

data map 10, 25, 26, 28, 29, 30, 36, 38, 39, 55, 

80, 198, 200, 206 
data memory retry exception 32, 39, 43, 45, 

47 

data memory 10, 23, 24, 25, 26, 28, 29, 30, 32, 

33, 36, 39, 41, 43, 45, 47, 50, 80, 81 
data protection 25, 29, 30, 32, 43, 45, 47, 138 
data result 44, 46, 47, 81, 163 
data segment number 10, 26, 39 
data segment 10, 25, 26, 29, 32, 47 
data state descriptor 25, 26, 38, 39. 82, 241 
data trap bit 22, 25, 154, 155 
data trap disable 25, 157 



data trap 23. 24. 31, 43, 45. 47. 157. 194, 204 
data value register 31. 32, 241 
debugger 49. 75 

denormdize 21. 46, 86, 102, 119. 126, 188 
depend 15. 16 

destjreg (field in DataControlDescriptor) 31, 
32. 81 

distribution-enable (field in DataMapEntry) 

26, 29, 36 
distributor 10, 26, 29, 36 
DomainDataAddress (struct) 28 
DomainDataTLB.A.ddress (struct) 28 
DomainProgramAddress (struct) 35 
DOMAIN JENTER (operation) 83 
DOMAD»'_IDENTIFIER_SAVE (operation) 38, 

84 

DOMAIXJLEAVE (operation) 38, 83 
domain_signal-trap.disable (field in StreamSta- . 

tusWord) 12 
domain-signal (field in ExceptionRegister) 44, 

45 

domain 25, 26, 28, 29, 35, 38, 39, 40, 41, 49, 
80, 82, 83, 138, 140, 141, 164, 230 

domain signal 12, 38, 41, 43, 45 

doubled precision 21, 88, 100, 185, 186 

drO (field in ResultCode) 46, 147 

dr7 (field in ResultCode) 46 

DR_DAT.A.-ALIGNMENT (enum in DataRe- 
sultCode) 47 

DR_DATA.TRAP01 (enum in DataResnitCode) 
47 

DRJ)ATA_TRAP0 (enum in DataResultCode) 
47 

DRJ)ATA.TRAP1 (enum in DataResultCode) 

47 

DR_1ATENCYJ-IMIT (enum in DataResult- 
Code) 47 

DR-MAPJ,IMIT (enum in DataResultCode) 

47 

DR_NONE (enum in DataResultCode) 47 
DR_PROTECTIONJ.EVEL (enum in DataRe- 
sultCode) 47 
DRJIETRYjLIMIT (enum in DataResultCode) 
47 

DR_SEGMENT_LIMIT (enum in DataResult- 
Code) 47 

DR_UNCORRECTABLE^RROR (enum in DataRe- 
sultCode) 47 
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DILUNIMPLEMENTED.IDDRESS (enum in 

DataResultCode) 47 
DR.UNIMPLEMEXTED.OP (enumin DataRe- 

suJtCode) 39, 47, 81 
d (protection domain) 38. 77. 78, 83. 84. 164 

166 . 

E 

effective address 24, 25, 26 
endian 23 

enumeration member 9 
enumeration name 9 
EventSelect (struct) 49 
event counter 49, 50, 53, 77, 78, 241 
ExceptionRegister (struct) 45, 46 
EXCEPTIONJIESTORE (operation) 43, 85 
EXCEPTION^AVE (operation) 43, 85 
exception 10, 11, 12. 13, 14, 15, 20, 21, 23, 
25, 26, 29, 30, 31, 32, 36, 38, 41, 43^ 
44, 45, 46. 47, 48, 49, 51, 53, 55, 75, 
81, 85, 86, 91, 92, 100, 102. 105, 109, 
112, 113, 123, 130, 132, 138, 140, 145, 
146, 149, 157, 165, 170, 171, 173, 174, 
194, 198, 199, 200, 201, 205, 208, 210, 
211,212,213,214.215,217, 218,219, 
220, 221, 222. 223, 224, 225, 226, 227, 
228, 229, 230 
Exception (enum) 43, 44, 45, 84, 194 
exception (exception status register) 85 164 
166 

exception register 11, 32, 43, 44 , 47, 85, 166. 
241 

execJevel (field in ProgramMapEntry) 33, 36 
exponent 20, 92, 93. 102, 103, 113, 119, 174 
Ex_Create (enum in Exception) 43 
Ex_Data_AJignment (enum in Exception) 43 
Ex_Data_Blocked (enum in Exception) 43 
Ex_DataJIWXrror (enum in Exception) 43 
Ex_Data_Prot (enum in Exception) 43 
ExJ)omain^ignal (enum in Exception) 43 
Ex_Float .Extension (enum in Exception) 43 
Ex_Float_Inexact (enum in Exception) 44 
ExJ'loai Jnvalid (enum in Exception) 44 
Ex-Float_Overflow (enum in Exception) 44 
Ex_Float.Underflow (enum in Exception) 44 
Ex_Float^eroJ)ivide (enum in Exception) 44 
ExJnstruction.Couni (enum in Exception) 43 



Ex_Poison (enum in Exception) 43 
Ex_Privileged (enum in Exception) 43 
ExJProgJIW Jlrror (enum in Exception) 43 
Ex_ProgJProt (enum in Exception) 43 

F 

fe.control (field in DataControlDescriptor) 23, 

24, 31, 116, 117, 118, 133, 134, 154, 
157, 175, 176, 177 

FE JUTURE (enum in FullEmpty Control) 24, 

25, 154, 157 

FE_NORMAL (enumin FuIlEmptyControl) 24, 
25, 32, 116, 117, 118, 133, 157, 175, 
176, 177 

FE_SYNC (enumin FuIlEmptyControl) 24, 25, 

154, 157 
field name 9 
Float32 (struct) 20 
Float64 (struct) 20, 92, 103, 124 
FloatResultCode (enum) 46 
FloatSelect (enum) 19 
FLOAT_A.DD J^UL (operation) 50, 88 
FLOAT_A.DD (operation) 50, 52, 87 
FLOAT_\PPROXJlESTORE (operation) 89 
FLOAT.CEIL (operation) 21. 86 
FLOATXHOP (operation) 86 
FLOATXMP.TEST (operation) 20, 90 
FLOATJDIV^PPROX (operation) 92 
FLOATJ)IVJERROR (operation) 93 
FLOAT_DrV (operation) 50, 91 
float.extension (field in ExceptionRegister) 21, 

45, 46, 102, 112, 113, 123, 130, 173, 
174 

FLOATJPLOOR (operation) 86 
floatJnexact_trap.disable (field in StreamSta- 

tusWord) 12 
float Jnexact (field in ExceptionRegister) 45, 

46, 86, 87, 88, 91, 92, 94, 100, 104, 
105, 106, 107, 108, 109, 143, 171 

FLOATJNT (operation) 94 
floatJnvalid_trap_disable (field in StreamSta- 

tusWord) 12 
floatJn^-aiid (field in ExceptionRegister) 20, 

21, 45, 46, 87, 88, 90, 92, 106, 107, 

109, 171 

FLOATJTER (operation) 91, 95, 188 
FL0AT31AX-TEST (operation) 96 



GF(2) Addressing Matrices 



249 



FLOAT_MAX (operation) 20. 96 
FLOAT-MIN.TEST (operation) 19. 20. 97 
FLOAT_MIN (operation) 20. 97 
FLOAT_MMAX.TEST (operation) 98 
FLO.\T_MMAX (operation) 9S 
FLO AT_MMI.\ .TEST (operation) 99 
FLOATJvIMIN (operation) 99 
FLOAT_VIUL_LO\VER (operation) 100 
FLOAT-NEAR (operation) 86 
floatjoverfiow.trap.disable (field in StreamSta- 

tusWord) 12 
floatjoverflow (field in ExceptionRegister) 45, 

46, 87, 88, 91, 92, 100. 104, 106. 107, 

143 

FLOATJIEAL (operation) 101. 143 

FLOAT_RECIP_A.PPROX.TEST (operation) 

102 

FLOAT-R-ECIP-APPROX (operation) 46, 102, 
188 

FLO AT_RECIPJ;RR0R (operation) 103, 188 
FLOAT-ROUND (operation) 86 

FLOATJISQRT.APPROX_TEST (operation) 
102 

FLOATJLSQRT^PPROX (operation) 46, 102 
FL0AT_RSQRTJ;RR0R.TEST (operation) 103 
FLOAT.SCALB (operation) 104 
FL0AT.SQRT.APPR0X.TEST (operation) 92, 
188 

FL0AT.SQRTJ:RR0R_TEST (operation) 93 
FLOAT.SQRT (operation) 50. 105 
FLOAT.SUB J^ULJLEV (operation) 100. 107 
FLOAT.SUB J4UL (operation) 107 
FLOAT.SUB (operation) 50, 106 
float-underflow _trap_djsable (field in Stream- 

StatusWord) 12 
float.underflow (field in ExceptionRegister) 45, 

46, 88, 91, 100, 104, 107, 143 
FLOAT.TJNS (operation) 108 
float jero.divide (field in ExceptionRegister) 

21, 45, 46, 92 
float jero.div.trapjiisable (field in StreamSta- 

tusWord) 12 
float 18, 19, 43, 45, 46, 50, 86, 94, 101, 108, 

109, 119, 143, 171, 188 
float inexact 12, 44, 45 
float invalid operation 44. 45 
float invalid 12, 45 
float overflow 12, 44, 45 



float point 9. 21. 87. 88. 90. 91. 92. 93. 95, 96, 

97. 98. 99, 100. 102. 103. 104, 105, 
106. 107. 123. 124, 126. 143 

float underflow 12. 44. 45 

float zero divide 12. 44. 45 

flop 180. 182 

forcedJink (field in lOPStatusWord) 194. 204 
format 9, 20, 52. 101, 102, 198, 200, 206, 230 
forward.enable (field in AccessState) 23, 154, 
157 

forward 13, 23, 24, 25. 31, 49, 50, 116, 117, 
118, 133, 138, 144, 146, 152, 154, 155, 
156, 157, 158, 175, 176. 177, 194, 204, 
212 

forward bit 25 

fraction (field in Float32) 20, 21, 22 
frame.offset (field in ProgL2Address) 35 
frame number 10, 33, 36 
FR_DQ (enum in FloatResultCode) 46 
FRJDR (enum in FloatResultCode) 46 
FRJDZ (enum in FloatResultCode) 46 
FR_FG (enum in FloatResultCode) 46 
FRJ'X (enum in FloatResultCode) 46 
FRJM (enum in EoatResultCode) 46 
full/empty 23, 24, 25, 31, 154, 157, 204 
FuUEmptyControl (enum) 23, 24, 31 
full (field in AccessState) 23, 24, 25, 138, 144, 

146, 154, 155, 156, 157, 158, 194, 212 
fwd.disable (field in DataControlDescriptor) 

23, 24, 31, 154, 157 

G 

general purpose register 55, 166, 241 
global memory 30 

H 

halfword 9, 10, 23, 24, 117, 160, 176 
hardware.trap.disable (field in StreamStatus- 

Word) 12 
hardware error 30, 43, 45 

I 

identifier 53, 84, 163, 166 
idle 12, 198 

IEEE754 floating point 9, 17, 20, 21, 86, 109, 
171 
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IF_0 (enum in CondMask) 18 

IFJ (enum in CondMask) 18 

IF J (enum in CondMask) 18 

IFJ (enum in CondMask) 18 

IF.4 (enum in CondMask) 18 

IF^ (enum in CondMask) 18 

IFJ (enum in CondMask) 18 

IF.7 (enum in CondMask) 18 

IF_A.LWAYS (enum in CondMask) 17 

IF.CY (enum in CondMask) 18 

IFJ;Q (enum in CondMask) 17, 18 

IF-FGE (enum in CondMask) 18 
IF_FGT (enum in CondMask) 18 
IF_FLE (enum in CondMask) 18 
IF_FLT (enum in CondiVIask) 18 
IFJFUN (enum in CondMask) 18 
IF_F (enum in CondMask) 18 
IFJGE (enum in CondMask) 18 
IFJGT (enum in CondMask) 18 
IFJLE (enum in CondMask) 18 
IFJLT (enum in CondMask) 18 
IFJMI (enum in CondMask) 18 
IFJMZ (enum in CondMask) 18 
IFJOV (enum in CondMask) 18 
IFJPL (enum in CondMask) 18 
IFJPZ (enum in CondMask) 18 
IFJiC (enum in CondMask) 18 
IFJfEVER (enum in CondMask) 17 
IF_NE (enum in CondMask) 18 
IFJ^Z (enum in CondMask) 18 
IF.T (enum in CondMask) 18 
IF.UGE (enum in CondMask) 18 
IF.UGT (enum in CondMask) 18 
IF-ULE (enum in CondMask) 18 
IF.ULT (enum in CondMask) 18 
IF-ZE (enum in CondMask) 18 
iUegaljjp^ode (iield in lOFStatusWord) 194 
instruction-count (field in ExceptionRegister) 
44,45 

instruction 10. 12, 13, 15, 16, 17, 24, 33, 35, 
36, 40, 43, 44, 45, 47, 49, 51, 52, 53. 
55. 85, 139, 147, 153. 164, 165, 166, 
169, 180, 182, 190. 192. 194. 195, 199, 
201, 203. 204, 205. 207, 208, 213, 214, 
216, 218, 220, 221, 222, 223, 225, 226, 
229, 230. 233, 241 
instruction counter 13, 47, 49, 164, 165, 166 
instruction issue counter 49, 77, 165. 241 



INSTJSEGMENT (operation) 198 
integer 9. 17, 18, 19. 46. 55. 86, 94, 95, 104. 
109, 110. 111. 112. 113, 119, 120, 122! 
123, 124, 125. 128, 129, 130. 172, 173, 
174, 178, 190, 192 
interconnect Jost (field in lOPStatusWord) 194, 

214, 224 
IntSelect (enum) 19 
IXT^DDJMM_TEST (operation) 111 
INT^DD JMM (operation) 55, 60, 111 
INTJlDD_MUL.TEST (operation) 112 
INT^DDJ^UL (operation) 112, 173 
INTJIDD.TEST (operation) 110 
INT_A.DD (operation) 60, 110 
INT.CEIL_TEST (operation) 109 
INT.CEIL (operation) 109 
INTXHOP.TEST (operation) 109 
INTXHOP (operation) 21, 109 
INT_DIV.CHOP-TEST (operation) 113 
INT_DIV.CHOP (operation) 113, 190 
INT-DIVJ-LOOR-TEST (operation) 113 
INTJ)IV_FLOOR (operation) 113, 190 
INTJPETCH^DDJICJ)ISP (operation) 114 
INTJ-ETCH^DDjVCJNDEX (operation) 114 
INTJETCH^DD-DISP (operation) 114 
L\TJ"ETCH_A.DD JNDEX (operation) 114 
INTJLOOR.TEST (operation) 109 
INT_FL0OR (operation) 109 
INTJMM (operation) 115, 137 
INTJ<OADBJlCJ)ISP (operation) 116 
INT_LOADB_ACJNDEX (operation) 116 
INTJ,OADBJ)ISP (operation) 116 
INT-LOADBJNDEX (operation) 116 
INT-LOADB (operation) 31, 116 
INT-LOADH-AC_DISP (operation) 117 
INT_LOADH_ACJNDEX (operation) 117 
INT-LOADH_DISP (operation) 117 
INTJ,OADH_rNDEX (operation) 117 
INT_LOADH (operation) 31, 117 
INT_LOADQ_AC_DISP (operation) 118 
INT_LOADQ^CJNDEX (operation) 118 
INTJ.OADQ_DISP (operation) 118 
INT_LOADQJNDEX (operation) 118 
INT_LOADQ (operation) 31, 118 
INTJLOGB.TEST (operation) 119 
INT_LOGB (operation) 119 
INTJ^AX.TEST (operation) 120 
INT_MAX (operation) 17, 120 
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INT_\1EM_ADD_'IC_DISP (operation) 121 
IXT_ME.M_A.DD_'^CJNDEX (operation) 121 
INT_MEM_A.DD_DISP (operation) 121 
IXT_MEM_A.DDJNDEX (operation) 121 
IXT-MIN.TEST (operation) 122 
IXT-MIN (operation) 122 
I.VT_\EAR-TEST (operation) 109 
INTJS'EAR (operation) 109 
INTJIECIP-APPROX (operation) 123 
INT_RECIP -ERROR (operation) 124 
IXTJIECIP_SHIFT-TEST (operation) 125 
INTJIECIP^HIFT (operation) 125 
INTJ10UND-.TEST (operation) 109 
INTJIOUND (operation) 109 
INTJlSQRT_A.PPROX (operation) 126 
INTJHIFTJUGHT-TEST (operation) 127 
INTJHIFTJUGHT (operation) 127 
INT_SUB JMM.TEST (operation) 129 
INT^UB JMM (operation) 129 
INT_SUBJvlUL-REV.TEST (operation) 130 
INTJUBJ^ULJLEV (operation) 130 
INT_SUB^UL-TEST (operation) 130 
INT^UB_MUL (operation) 130 
INT_SUB.TEST (operation) 128 
INT^UB (operation) 128 
invalid operation 21 
IN_A.CCEPT (operation) 226- 
IN-DELAY (operation) 229 
INJNTERCONNECT (operation) 228 
IN_LINK^TORE (operation) 223 
IN_LINK (operation) 223 
IN_LISTEN (operation) 224 
INXOOPBACK (operation) 215, 227 
IN_REJECT (operation) 225 
lOPStatusWord (struct) 194, 195 
lopStream (enum) 195 
lOP JN (enum in lopStream) 195 
lOPXOAD (enum in lopStream) 195 
lOP-OUT (enum in lopStream) 195 
lOP^TORE (enum in lopStream) 195 

J 

JUMP.OFTEN (operation) 16, 131 
JUMPJELDOM (operation) 16, 131 
JUMP (operation) 13, 14, 50, 55. 60, 131, 169 

K 



KERNEL level 12. 38, 51. 164. 241 

L 

LlAddress (struct) 36 
L2.\ddress (struct ) 36. 37 
la (field in Operation) 15 
LEVEL_ENTER (operation) 38, 50, 132 
LEVEL_RTN (operation) 38, 47, 132, 169 
level 12, 15, 25, 26. 29, 32, 33. 36, 38, 39, 40, 

41, 45, 47, 48, 50, 55, 80, 132, 138, 

164, 198, 200, 206, 241 
Level (enum) 26, 33, 38, 39, 40, 41, 138, 169 
level (stream level) 132, 138, 164, 166 
LEVJPL (enum in Level) 38, 138, 241 
LEVJCERNEL (enum in Level) 38 
LEV^UPER (enum in Level) 38, 48, 241 
LEV.USER (enum in Level) 38, 138, 241 
limit_error (field in lOPStatusWord) 194, 201, 

204, 207, 208, 212 
limit (field in DataStateDescriptor) 12, 25, 29, 

30, 32, 33, 39, 41, 45, 47, 198, 200, 

206 

lineJndex (field in LlAddress) 35, 36, 37 
linkjerror (field in lOPStatusWord) 194 
LOAD_ACJ)ISP (operation) 133 
LOAD_ACJNDEX (operation) 133 
LOADJ)ATA (operation) 204 
LOADJ)ISP (operation) 133 
LOAD_ENDJ>ACKET (operation) 203, 222 
LOAD_ERR.OFFSET (operation) 201 
LOAD JE (operation) 134 
LOAD JLUSH (operation) 202 
LOADJMAGE (operation) 204, 222 
LOADJNDEX (operation) 133 
loadJevel (field in DataMapEntry) 26, 29 
LOADJ,INK_OUT (operation) 199, 215, 222 
LOAD_LINK (operation) 199 
LOAD-SEGMEKT (operation) 200 
LOAD (operation) 23, 24, 31, 49, 50, 60, 114, 

121, 133, 207, 222, 231 
locality 29 
local memory 139 

locked (memon- ceD) 23, 25, 26, 28, 155, 163, 

198, 200. 206 
LOGICAL_\LLONE.TEST (operation) 135 
LOGICAL^LLONE (operation) 135, 136 
LOGICAL-ONE.TEST (operation) 136 
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LOGICAL.ONE (operation) 135, 136 
logical.unit (field in DaiaMapEntry) 26, 29 
logical unit number 26. 29, 30, 198 
logical unit offset 26, 29. 33, 36 
lookahead-disable (field in StreamStatusWord) 
13. 16 

lookahead 12. 13, 15, 16, 31, 43, 47, 52, 81, 
83, 131, 132, 147, 152, 163, 167, 180, 
182 

loopback (field in lOPStatusWord) 195, 207, 
215, 216, 217, 227 

M 

M-operation 15, 47, 52 

M-unit 15, 16, 23, 31, 32, 46. 50, 55. 81, 137, 
163 

MAC-operation 15, 16, 44 
map base 33 

map entry 25, 26. 28, 33, 35, 36, 241 
map limit 25, 45, 80, 138, 140 
matrix 29, 55, 62, 242 
MC-operation 15 

memory .type (field in DataMapEntry) 26, 30 
memory reference counter 49, 77, 241 
memory retry 13, 25, 29, 30. 31, 39, 44, 47, 

49, 198 
memory system 9, 30, 36 
memory unit 10, 29, 30 
min_dkill (field in DataStateDescriptor) 39, 80 
min.pkill (field in ProgramStateDescriptor) 40, 

48 

min-psleep (field in ProgramStateDescriptor) 
40 

Mop (field in Operation) 15 

N 

NaNResultCode (enum) 21, 22 

NAN JNF J)IVJNF (enum in NaNResultCode) 
22 

NAN JNF JUB JNF (enum in NaNResultCode) 

22 

NAN.SQRT_\EG (enum in NaNResultCode) 
22 

NAN_ZEROJ)IV.ZERO (enum in NaNResult- 
Code) 22 

NAN_ZERO_MUL JNF (enum in NaNResult- 
Code) 22 



NaN 17, 18. 20, 21, 22, 90, 93, 96, 97, 98, 99. 

100, 101, 119, 143 
negative 10, 17. 22, 90, 92. 96, 97, 98, 99, 119, 

171, 190 
nonstandard 17 
NOP (operation) 50, 52, 137 
normalize 21 

no.connection (field in lOPStatusWord) 194, 
222 

0 

OPAJDATA.0PA^AVE (enum in RetryOp- 
Code) 32 

OPAJ)ATA-OPD.SAVE (enum in RetryOp- 
Code) 32 

OPAJ)ATA_STATEJElESTORE (enum in Retry- 

OpCode) 32 
OPAJNT_FETCH-.A.DD (enum in RetryOp- 

Code) 31 

OPAJNTJ.OADB (enum in RetryOpCode) 
31 

OPAJNTJ-OADH (enum in RetryOpCode) 
31 

OPAJNTJ,OADQ (enum in RetryOpCode) 
31 

OPAJLOAD (enum in RetryOpCode) 31 
OPA^IAP JLUSHJ^NY (enum in RetryOp- 
Code) 32 

OPAJ^AP JLUSH (enum in RetryOpCode) 
32 

OPAJ»ROBE (enum in RetryOpCode) 31 
OPAJIEG J-OAD (enum in RetryOpCode) 31 
OPA_SCRUB J-OAD (enum in RetryOpCode) 
31 

OPA_STATEJ,OAD (enum in RetryOpCode) 

31 

OPA.STATEJ,OCK (enum in RetryOpCode) 
31 

OP.\^TATE.STORE (enum in RetryOpCode) 

31 

OPA.STOREB (enum in RetryOpCode) 31 
OPA^TOREH (enum in RetryOpCode) 31 
OPA.STOREQ (enum in RetryOpCode) 31 
OPA.STOREJEMPTY (enum in RetrvOpCode) 

31 

OP-A_STORE (enum in RetryOpCode) 31 
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OPA.STREAM.CATCH (enum in RetrvOp- 
Code) 32 

OPAJTREAM.CREATE (enum in RetrvOp- 
Code)32 

OPA.UXS_LOADB (enum in ReirvOpCode) 

31 

OPA.l'.\S_LOADH (enuna in RetrvOpCode) 
31 

OPA-UXSJ.OADQ (enum in RetrvOpCode) 
31 

operating system 37, 38, 39, 43, 48, 167, 168 
OperationAccessControl (struct) 24 
OPERATIONU (operation) 53 
OPERATIONS (operation) 53 
Operation (struct) , 15, 24, 52, 53, 55. 57, 59. 
61. 63, 65, 67, 69, 71, 73, 75, 77, 79. 
81. 83, 85, 87, 89, 91, 93, 95^ 97, 99, 
101, 103, 105, 107, 109, 111, 113, 115. 
117, 119, 121, 123, 125, 127, 129. 131, 
133, 135, 137, 139, 141, 143, 145, 147, 
149, 151, 153, 155, 157, 159, 161, 163, 
165, 167, 169, 171, 173, 175, 177, 179, 
194, 197, 199, 201, 203, 205, 207, 209, 
211, 213, 215, 217, 219, 221, 223. 225, 
227, 229, 233, 235, 237. 239 
os_field (field in lOPStatusWord) 194 
OUT-CA^'CEL (operation) 218 
OUTJDELAY (operation) 220 
OUT_DISCONNECT (operation) 217, 227 
OUTJNTERCONNECT (operation) 219 
OUT_LINK J.OAD (operation) 213 
OUT_LmK (operation) 213 
OUT_LOOPBACK (operation) 215, 216, 227 
OUT_LOOPMODE (operation) 216 
OUTJ>.\CKET (operation) 194, 203, 222 
OUTJIESET (operation) 221 
OUTJONG (operation) 214 
overflow/NaN 17, 58, 59, 60, 63, 64, 65, 66, 
68, 70, 71, 73, 74, 90, 96, 97, 98, 99, 
109, 112, 119, 120, 122, 127, 130, 148, 
149, 150, 151, 168, 171, 173, 179 



packet.end (field in lOPStatusWord) 194, 208, 
212 

PageNumber (a base type) 10, 33, 41 
C.l Scrambling Matrices 



pc (field in lOPStatusWord) 13. 14, 33, 44, 
47, 83, 131. 132. 152. 164. 166. 169^ 
194, 195, 201. 208. 224. 231 

pc (program counter) 15 

pfl (field in ExceptionRegister) 46 

pf3l (field in ExceptionRegister) 46 

phantom counter 51, 79, 241 

physical unit number 29, 30 

physical unit offset 29. 33 

pointer 9, 10 , 23 . 24, 25, 55, 134, 138. 142. 
154, 155, 166 

Pointer (struct) 23, 24. 141, 143, 145 

poison 11, 32, 43, 44, 45, 46, 51, 81, 144, 145, 
146, 149, 170 

positive 13, 17, 90, 93, 96, 97, 98, 99, 119 

precedence 40, 56 

privilege 12, 15, 25, 29. 36, 38, 39, 40, 41, 43, 
44, 45, 47, 48, 55, 75, 78, 80, 82, 83, 
84, 132, 138, 139, 140, 141, 164, 167, 
169, 241 

piiv_quit (field in ProgramStateDescriptor) 41, 
167 

priv.t0 (field in ProgramStateDescriptor) 14, 

41, 48, 169 
ProbeControI (enum) 138 
PROBEJ)ISP (operation) 138 
PROBE JNDEX (operation) 138 
ProgFrame (a base type) 10, 33, 41 
ProgL2Address (struct) 35 
ProgramLAddress (struct) 10, 13. 33 
ProgramAddrUns (a base type) 10, 13, 35 
ProgramMapEntry (struct) 33 
ProgramStateDescriptor (struct) 40, 41 

PROGRAMXACHEJ-LUSH-^NY (operation) 

37, 139 

PR0GRAM.CACHEJ-LUSHJL1 (operation) 
37, 139 

PROGRAM-CACHEJ-LUSH (operation) 37, 
139 

PROGRAMJilAP J'LUSH^NY (operation) 35, 
140 

PROGRAM-MAP JLUSH (operation) 35, 140 
PROGRAM^TATEJIESTORE (operation) 40, 
141 

program address 11, 13, 14, 33, 35, 36, 38, 44, 
140, 241 

program counter (PC) 10, 11, 12, 13, 14, 15, 
36, 169, 180, 194 
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program map limit 33. 41 
program map 10, 33. 35. 36, 38, 41, 132. 140 
program memory 10, 33. 35, 36. 37 
program protection 33. 36. 43, 45. 132 
program segment 198 

program state descriptor 33, 38. 39. 40, 48. 

141, 167, 169, 241 
ProgTlbAdd- (struct) 35 
progjrame (field in ProgramMapEntrv) 33, 

35, 36 

prog_hwjerror (field in ExceptionRegister) 44, 
45 

prog4)agejiumber (field in ProgramAddress) 

33 

prog.page.offset (field in ProgramAddress) 33 
prog.prot (field in ExceptionRegister) 44, 45 
48, 132 

protection domain 12, 14. 26, 33, 38, 39, 40. 

41. 48, 49, 55, 77. 82, 83, 84, 141. 163, 

165, 166, 167, 168. 241 
PTR^ET^C (operation) 142 
p_limit_error (field in lOPStatusWord) 194, 198 
P-MODIFY (enum in ProbeControl) 138 
-PJIEAD (enxim in ProbeControl) 138 
p.uncorr€ctabl€.error (field in lOPStatusWord) 

194, 198 

p-unimplemented-address (field in lOPStatus- 
Word) 194, 198 

Q 

quarterword 9, 23, 24, 118, 161, 177 

R 

ready counter 51, 79, 222. 241 

REALJ-LOAT (operation) 143 

reciprocal 89, 91, 95, 102. 103, 105, 123, 126, 

188, 190, 191, 241 
REG-LOAD.\CJ)ISP (operation) 144 
REGJ,OAD_A.CJNDEX (operation) 144 
REG_LOAD_DISP (operation) 144 
REG_LOADJNDEX (operation) 144 
REG_MOVE (operation) 32. 60. 145 
REG^TORE_^CJ)ISP (operation) 146 
REG.^TORE_\CJNDEX (operation) 146 
REG-STOREJDISP (operation) 146 
REGJTOREJ.XDEX (operation) 146 



Reg (a base type) 10. 31, 46 
Resource (enum) 26. 30, 39, 49, 51, 164, 165. 
167 

restop (field in DataControIDescriptor) 31 
RESULTCODE^AVE (operation) 147 
ResultCode (struct) 46 
result code (result code register) 147, 164, 166 
result code register 11, 21, 43, 46, 47. 81, 85, 

147, 166, 241 
result code 21, 22, 46, 47, 81 
RES_DMEM (enum in Resource) 30 
RESJOM (enum in Resource) 30 
RESJOP (enum in Resource) 30 
RetryOpCode (enum) 31, 32 
retry Jimit (field in DataStateDescriptor) 39 
RNDXEIL (enum in RoundMode) 21 
RND.CHOP (enum in RoundMode) 21 
RNDJLOOR (enum in RoundMode) 21 
RND_NEAR (enum in RoundMode) 21 
ROTATE-LEFT.TEST (operation) 148 
ROTATEJ,EFT (operation) 148 
ROTATEJIIGHT.TEST (operation) 148 
ROTATE_RIGHT (operation) 148 
rounding mode 13, 21, 86, 94, 108, 109, 171, 
185 

RoundMode (enum) 13, 21 

roundjnode (field in StreamStatusWord) 13, 

21 

round 21, 86, 109, 113, 125, 171, 174, 178, 185, 
188, 218 

round to nearest 21, 92, 93, 95, 103, 124, 185 



scramble 26, 29, 33, 36 
scur (streams in use) 39, 83, 163, 166, 167, 241 
SegmentOffset (a base type) 10, 26 
segment.base (field in DataMapEntry) 26, 29 
segmentJimit (field in DataMapEntr}-) 26, 29 
segment-ofFset (field in DomainDataAddress) 
28 

segment base 26, 29 
segment limit 25, 26, 29, 45 
SELECTJ-LOAT.TEST (operation) 149 
SELECT_FLOAT (operation) 19, 149 
S ELECT JNT.TEST (operation) 149 
SELECTJNT (operation) 19, 149 
seLO (field in EventSelect) 49 
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sel_l (field in EventSelect) 49 
sel_2 (field in EventSelect) 49 
sel_3 (field in EventSelect) 49 
SEL.CY (enum in IntSeleci) 19. 149 
SEL J;Q (enum in IntSelect) 19, 149 
SEL_FGE (enum in FloatSelect) 19, 149 
SEL_FGT (enum in FloatSelect) 19, 149 
SEL_FLE (enum in FloatSelect) 19, 149 
SELJLT (enum in FloatSelect) 19, 149 
SEL_FU.\ (enum in FloatSelect) 19, 149 
SEL JGE (enum in IntSelect) 19, 149 
SEL JGT (enum in IntSelect) 19, 149 
SELJPL (enum in IntSelect) 19, 149 
SEL JPZ (enum in IntSelect) 19, 149 
SEL.UGE (enum in IntSelect) 19, 149 
SEL.UGT (enum in IntSelect) 19, 149 
setJndex (iield in DomainDataTLBAddress) 
28 

set jiumber (field in DomainDataAddress) 28. 

35. 36, 37 

SHIFT J-EFTJMM.TEST (operation) 150 
SHIFTJLEFTJMM (operation) 150 
SHIFTJLEFT.TEST (operation) 150 
SHIFT J,EFT (operation) 150 
SHIFT_PAIILLEFT-TEST (operation) 151 
SHIFTJ>AIILLEFT (operation) 151 
SHIFT_PAnLRIGHT_TEST (operation) 151 
SHIFT_PAIILRIGHT (operation) 151 
significand 20, 143 

sign (field in noat32) 17, 20, 88, 93, 107, 116, 

117, 118, 190 
sign bit 20, 56, 127 
SKIP .OFTEN (operation). 16, 152 
SKIP^ELDOM (operation) 16, 152 
SKIP (operation) 13, 50, 152 
sleep 40 

slim (field in ProgramStateDescriptor) 41 
slim (stream count limit) 39, 41, 168 
SpecialFloat64 (struct) 20, 91, 92, 95, 102, 

103, 105, 113, 123, 124, 126, 174 
specJoad-enable (field in StreamStatusWord) 

13,32 

square root 20, 56, 89, 92, 93, 95, 103, 105, 

126, 188, 241 
sres (streams reserved) 39, 49, 83, 163, 166, 

167, 168, 241 
SSW_DISP (operation) 153 



ssw.override (field in StreamStatusWord) 13, 

25. 47, 132 
SSWJIESTORE (operation) 153 
ssw (stream status word) 10, 11. 12. 14, 16, 

21. 25, 26. 32, 43. 44. 47, 49, 83, 85, 

86. 94. 108, 109, 131, 132, 152. 153, 

164, 166, 169, 171 
stall (field in DataMapEntry) 26, 29, 48 
standard 16, 20, 21 
STATE J<OAD_DISP (operation) 154 
STATEJ-OADJNDEX (operation) 154 
STATEJ.OCK.ACJDISP (operation) 155 • 
STATEJ-OCKJ^CJNDEX (operation) 155 
STATEJ-OCKJDISP (operation) 155 
ST.ATE_LOCKJNDEX (operation) 155 
STATE^CRUB JDISP (operation) 156 
STATE.SCRUBJNDEX (operation) 156 
STATE.STORE.AC_DISP (operation) 157 
STATE_STORE^CJNDEX (operation) 157 
STATE.STORE_DISP (operation) 157 
STATE^TORE-ERROR-DISP (operation) 158 
STATE_STORE JIRRORJNDEX (operation) 

158 

STATE^TOREJNDEX (operation) 157 
statnsJink (field in lOPStatusWord) 194, 201, 
208, 224 

STOREB_ACJDISP (operation) 159 
STOREB_ACJNDEX (operation) 159 
STOREBJ)ISP (operation) 159 
STOREBJNDEX (operation) 159 
STOREB (operation) 31, 159 
STOREH_AC-DISP (operation) 160 
STOREH_A.CJNDEX (operation) 160 
STOREHJDISP (operation) 160 
STOREHJNDEX (operation) 160 
STOREH (operation) 31, 160 
STOREQ_AC-DISP (operation) 161 
STOREQJVCJNDEX (operation) 161 
STOREQJ)ISP (operation) 161 
STOREQJNDEX (operation) 161 
STOREQ (operation) 31, 161 
STOREJlCJ)ISP (operation) 162 
STOREjiXJNDEX (operation) 162 
STORE-DATA (operation) 210, 212, 223 
STORE-DISP (operation) 162 
STGREJIND J>ACKET (operation) 210 
STORE_END.SEGMENT (operation) 211 
ST0REJ:RR_0FFSET (operation) 208 
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STORE_FLUSH (opemion) 209 
STORE JMAGE (operaiion) 210, 212. 223 
STOREJNDEX (operation) 162 
storeJevel (field in DataMapEntry) 26. 29 
STOREJLINKJN (operation) 205. 224. 227, 
231 

STORE_LINK (operation) 205 
STORE-REPLICATE (operation) 207, 223 
STORE^EGMENT (operation) 206 
STORE (operation) 23, 24, 31, 49, 50, 55, 114, 

121, 162, 207, 224 
StreamStatusWord (struct) 12, 13, 169 
STREAM.CATCH (operation) 32, 164 
STREAM-CO UNTJNSTJIESTORE (opera- 
tion) 49, 165 
STREAM.COUNTJNST (operation) 49, 165 
STREAM.CREATEJMM (operation) 166 
STREAM.CUR_SAVE (operation) 39, 163 
STREAMJDENTIFIER_SAVE (operation) 163 
StreamJdentity (field in lOPStatusWord) 195 
STREAM JLOOKAHEAD^AVE (operation) 163 
STREAM.QUIT_PRESERVE (operation) 167 
STREAM.QUIT (operation) 12, 15, 39, 40, 

41, 48, 51, 166, 167 
STREAM_RESERVE.TEST (operation) 168 
STREAMJlESERVE.UPTO.TEST (operation) 
168 

STREAMJIESERVE-UPTO (operation) 168 
STREAMJIESERVE (operation) 12, 39, 166, 
168 

STREAM JIES-SAVE (operation) 39, 163 

stream 10, 11, 12, 14, 15, 17, 23, 25, 29, 31, 
33, 36, 38, 39, 40, 41, 43, 44, 45, 47, 
48, 49, 51, 55, 77, 79, 81, 83, 132, 163, 
165, 166, 167, 168, 182, 194, 195, 198, 
199, 200, 201, 202. 203, 204, 205, 206, 
207, 208, 209, 210, 212, 213, 214, 215, 
216, 217, 222, 223, 224, 226, 227, 230, 
231, 241 

stream counter 49, 77, 241 

stream create exception 43, 45 

stream level 45, 241 

stream number 163 

stream status word (saw) 10, 11, 12, 33. 194, 
241 

structure name 9 

subblockJndex (field in L2Ad dress) 35, 37 
supervisor level 12. 29, 38. 78, 241 



synchronization 25 

system.trapjiisable (field in StreamStatusWord) 

12 • 
system trap 12 
S (total stream count) 168 

T 

tag (field in DomainDataAddress) 28, 35, 36, 

37, 49, 50, 77 
TARGETJDISP (operation) 60, 169' 
TARGETJNDEX (operation) 169 
TARGET_RESTORE (operation) 153, 169 
TARGET.SAVE (operation) 50, 169 
target register TO 14, 41, 44 , 47, 48, 164, 166 
target register 11, 14, 16, 33, 50, 131, 153, 166, 

169, 241 

timeout (field in lOPStatusWord) 194, 214, 
215, 217, 218, 219, 220, 221, 222, 224, 
226, 227, 228, 229, 231 

tn (target register) 131 

trapO-dlsable (field in DataControIOescriptor) 
24, 31 

trapO.enable (field in AccessState) 23, 154, 
157 

trapOioadjdisable (field in Pointer) 23, 154, 

157 

trapO-storejdisable (field in Pointer) 23, 154, 
157 

trapLdisable (field in DataControlDescriptor) 

24, 31 

trapLenable (field in AccessState) 23, 154, 
157 

traplioadjdisable (field in Pointer) 23, 154, 
157 

trapl.store.disable (field in Pointer) 23, 154, 
157 

TRAPJIESTORE (operation) 32, 48, 170 

TRAP.SAVE (operation) 48, 170 

trap 11, 12. 13, 14, 15, 21, 25, 29, 40, 43, 44, 
46- 47, 48, 51, 81, 85, 138, 144, 146, 
153, 154, 156, 157, 158, 166, 188, 212 

trap handler 11, 14, 21, 22, 31, 41, 43, 44, 47, 
48, 81. 166, 170, 188 

trap mask 11, 12, 43, 47, 85, 131 

trap register TRO 48 

trap register 11, 48, 166, 167, 170, 241 
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U 

unaJigned.data..enable (field in SlreamStatus- 

Word) 13. 25, 26 
unaligned data 13, 43. 45 
uncorrectable.error (field in lOPStatusWord) 

194. 20L 204, 207, 208, 210, 211, 212, 

222, 226 

uncorrectable program memory error 43, 45 
unimplemented-address (field in lOPStatus- 
Word) 194, 201, 204, 207, 208, 212 
unimplemenied operation 47 
unsigned integer 9, 15, 20, 55, 108, 110, 133, 

134, 168, 171, 173, 174, 190 
l'NS_A.DD.CARRY-TEST (operation) 172 
UNS_A.DD_NIUL.UPPER_TEST (operation) 173 
U^'S_A.DDJv^UL.UPPER (operation) 173 
UNS-CEIL_TEST (operation) 171 
UNS-CEIL (operation) 171 
UNS.CHOP-TEST (operation) 171 
UNS-CHOP (operation) 171 
UNSJDIV.TEST (operation) 174 
UNS_DrV (operation) 174 
UNS_FLOOR.TEST (operation) 171 
UNS-FLOOR (operation) 21, 171 
UNSJLOADB_A.CJDISP (operation) 175 
UNSXOADBJ\.CJNDEX (operation) 175 
UNS_LOADBJ)ISP (operation) 175 
UNSJ-OADBJNDEX (operation) 175 
UNS-LOADB (operation) 31, 175 
UNSXOADHJ^C-DISP (operation) 176 
UNS_LOADH_AC_LNDEX (operation) 176 
UNSXOADH-DISP (operation) 176 
UNS XOADHJNDEX (operation) 176 
UNS-LOADH (operation) 31, 176 
UNSXOADQ_A.C-DISP (operation) 177 
UNSXOADQJICJNDEX (operation) 177 
UNSXOADQ_DISP (operation) 177 
UNSXOADQJNDEX (operation) 177 
UNSXOADQ (operation) 31, 137, 177 
UNS JsEAR-TEST (operation) 171 
UNSJS'EAR (operation) 171 
UNS_RECIP_SHIFT-TEST (operation) 178 
UNS-RECIP^HIFT (operation) 178 
UNS-ROUND.TEST (operation) 171 
UNS-ROUND (operation) 171 
UNS^HIFTJUGHT.TEST (operation) 179 
UNS^HIFT_RIGHT (operation) 179 



UNS^UB.CARRY.TEST (operation) 172 
user.trapjiisable (field in StreamStatusWord) 

12 

user trap 12 

w 

word 9, 10, 23, 24, 25, 30, 33, 36, 53, 55, 60, 
62, 70, 81, 114, 121, 133, 134, 139, 
144, 146. 148, 150, 155. 156, 157, 158, 
162, 194, 195, 198. 199, 201, 203, 204, 
205, 207, 208, 212, 213, 214, 222, 223, 
224, 226, 231, 233 

Z 

zero divide 46 
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